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Process for preparing amino acids 

The present invention relates to a process for preparing amino acids in transgenic organisms. 

The invention further relates to nucleic acid constructs, vectors and transgenic organisms, and 
to the use thereof. 

5 Amino acids form the basic structural unit of ail proteins and are thus essential for normal cell 
functions. The term "amino acid" is known in the art. The proteogenic amino acids, of which 
there are 20 types, serve as structural units for proteins in which they are linked together via 
peptide bonds, whereas the non-proteogenic amino acids (of which hundreds are known) usually 
do not occur in proteins [see Ullmann's Encyclopedia of Industrial Chemistry, Vol. A2, 

10 pages 57-97 VCH: Weinheim (1985)]. The amino acids can exist in the D or L configuration, 
although L-amino acids are usually the only type found in naturally occurring proteins. 
Biosynthetic and degradation pathways of each of the 20 proteogenic amino acids are well 
characteri7erl hoth in prokaryotic and eukaryotic ceils (see, for example, Stryer, L. Biochemistry, 
. 3rd edition, pages 578-590 (1988)). The "essential" amino acids (histidine, isoleucine, leucine, 

15 lysine, methionine, phenylalanine, threonine, tryptophan and vaiine), so called because they 

must be obtained through the diet because of the complexity of their biosynthesis, are converted 
by simple biosynthetic pathways into the other 1 1 "nonessential" amino acids (alanine, arginine, 
ooparagine, aspartic acid, cysteine, glutamic acid, glutamine, glycine, proline, serine and 
tyrosine). Higher animals have the ability to synthesize some of these amino acids, but the 

20 essential amino acids must be obtained from food for normal protein synthesis to take place. 

Amino acids are used in many branches of industry, including the human and animal food, 
cosmetics, pharmaceutical and chemical industries. Thus, L-glutamic acid is used for example in 
infusion solutions. Amino acids such as D,L-methionine, L-lysine or L-threonine are used in the 
animal food industry. Particularly important for the diet of humans and many useful animals are 

25 the essential amino acids vaiine, leucine, isoleucine, lysine, threonine, methionine, tyrosine, 
phenylalanine and tryptophan. Thus, for example, lysine is an important amino acid not only for 
the human diet but also for monogastric animals such as poultry and pigs. L-Lysine is the 
limiting amino acid in plants such as corn or wheat, which is to say that in order to enable 
optimal utilization of such plant food it is sensible to supplement the human or animal food with 

30 L-lysine. Glutamate is most frequently used as flavor additive (monosodium glutamate, MSG) 
and is used widely in the food industry, as are aspartate, phenylalanine, glycine and cysteine. 
Glycine, L-methlonine and tryptophan are ail used in the pharmaceutical industry. Glutamine, 
valine, leucine, isoleucine, histidine, arginine, proline, serine and alanine are used in the 
pharmaceutical industry and the cosmetics industry. Threonine, tryptophan and D/L-methionine 

35 are widely used animai food additives [Leuchtenberger, W. (1 996) Amino acids - technical 

production and use, pages 466-502 in Rehm et al., (editors) Biotechnology Vol. 6, Chapter 14a, 
20020960 UP/12.20.2002 Sec l 
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VCH: Weinheim]. In addition, amino acids are suitable for the chemical industry as precursors 
for synthesizing synthetic amino acids and proteins such as N-acetylcysteine, S-carboxymethyl- 
L-cysteine, (S)-5-hydroxytryptophan and other substances described in SJilmann's Encyclopedia 
of Industrial Chemistry, Vol. A2, pages 57-97, VCH, Weinheim, 1985. 

5 The annual production of amino acids currently amounts to over 1 million t/a with a market value 
of more than 2 billion US$. They are at present produced by four competing processes: 

1 . extraction from protein hydrolysates, for example of L-cystine, L-leucine or L-tyrostne, 

2. chemical synthesis, for example of D,L-methionine, 

3. conversion of chemical precursors in an enzyme or cell reactor, for example L- 
10 phenylalanine and 

4. fermentative preparation by larrjp-^lfi culturing of bacteria developed in order to 
produce and separate large amounts of the particular desired molecule. An organism 
particularly suitable for this purpose is Corynebacterium glutamicum, which is used for 
example to prepare L-lysine or L-glutamic acid. Further examples of amino acids 

15 prepared by fermentation are L-threonine, L-tryptophan, L-aspartic acid and L- 

phenylalanine. 

The biosynthesis of natural amino acids in organisms able to produce them, for example 
bacteria, has been well characterized [for a review of bacterial amino acid biosynthesis and its 
regulation, see Umbarger, H.E. (1978) Ann. Rev. Biochem. 47: 533 - 606], Glutamic acid is 

20 synthesized by reductive amination of a-ketoglutarate, an intermediate in the citric acid cycle. 
Glutamine, proline and arginine are each produced successively from glutamate. The 
biosynthesis of serine takes place in a three-step process starting with 3-phosphoglycerate (a 
glycolysis intermediate) and resulting, after oxidation, transamination and hydrolysis steps, in 
this amino acid. Cysteine and glycine are each produced from serine, the former by 

25 condensation of homocysteine with serine, and the latter by transfer of the side-chain p carbon 
atom to tetrahydrofolate in a reaction catalyzed by serine transhydroxymethyiase. Phenylalanine 
and tyrosine are synthesized from the precursors of the glycolysis pathway and pentose 
phosphate pathway, erythrose 4-phosphate and phosphoenolpyruvate in a 9-step biosynthettc 
pathway differing only in the last two steps after the synthesis of prephenate. Tryptophan is 

30 likewise produced from these two starting molecules but it is synthesized in an 1 1 -step pathway. 
Tyrosine can aiso be produced from phenylalanine in a reaction catalyzed by phenylalanine 
hydroxylase. Alanine, valine and leucine are each hinsynthetic products of pyruvate, the final 
product of glycolysis. Aspartic acid is formed from oxalacetate, an intermediate in the citric acid 
cycle. Asparagine, methionine, threonine and lysine are each produced by conversion of aspartic 
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acid. Isoleucine is formed from threonine. Histidine is formed from 5-phosphoribosyl 1- 
pyrophosphate, an activated sugar, in a complex 9-step pathway. 

The preparation of amino acids by fermentation of strains of coryneform bacteria, especially 
Corynebacterium glutamicum, is known. Because of the great importance, there is continuous 
5 work on improving the existing preparation processes. Process improvements may relate to 
measures of fermentation technique, such as, for example, stirring and oxygen supply, or the 
composition of the nutrient media, such as, for example, the sugar concentration during the 
fermentation, or the working up to the product, for example by ion exchange chromatography, or 
the intrinsic production properties of the microorganism itself. Bacteria of other genera such as 
10 Escherichia or Bacillus are also used for preparing amino acids. 

A number of mutant strains producing a range of desirable compounds from the series of sulfur- 
containing fine chemicals have been developed by strain selection. The methods used to 
improve the production properties of these microorganisms in terms of the production of a 
particular molecule are those of mutagenesis, selection and choice of mutants. This is, however, 

1 5 a time-consuming and difficult process. EP-A-0 066 129 describes by way of example a process 
for preparing threonine using oorynebacteria. Corresponding processes have also been 
elaborated for preparing methionine. In this way, for example, strains are obtained which are 
resistant to antimetabolites such as, for example, the methionine analogs a-methylmethionine, 
ethionine, norleucine, N-acetylnorleucine, S-trifluoromethyihomocysteine, 2-amino-5-heprenoit 

20 acid, selenomethionine, methionine sulfoximtne, methoxine, 1-aminocyciopentanecarboxyiic 
acid or auxotrophic for metabolites of regulatory importance and produce sulfur-containing fine 
chemicals such as, for example, L-methionine. Processes of this type developed for preparing 
methionine have the disadvantage that the yields are too low for economic utilization and they 
are therefore unable to compete with chemical synthesis. 

25 Zeh el al. (Plant Physiol., Vol. 127, 2001: 792 802) describe an increase in the methinnine 
content in potato plants through inhibition of threonine synthase by so-called antisense 
technology. This leads to a reduced activity of threonine synthase without reducing the threonine 
content in the plants. It is disadvantageous that this technology is very complicated and can be 
used only very poorly on an industrial scale, if at all. In addition, there must be highly 

30 differentiated inhibition ot the enzymic activity because, oltiei wise, an auxotrophy for the amino 
acid occurs and the plant no longer grows. 

Methods of recombinant DNA technology have likewise been employed for some years for strain 
improvement for L-amino acid-producing Corynebacterium strains by amplifying individual amino 
acid biosynthesis genes and examining the effect on amino acid production. 
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Amounts of amino acids exceeding the protein biosynthesis requirements of the cell cannot be 
stored and are instead degraded, so that intermediates are provided for the main metabolic 
pathways of the cell [for a review, see Stryer, L, Biochemistry, 3rd edition, Chapter 21 "Amino 
. Acid Degradation and the Urea Cycle"; pages 495-516 (1988)]. Although the cell is able to 
5 convert unwanted amino acids into useful metabolic intermediates, amino acid production is 
costly in terms of energy, the precursor molecules and the enzymes necessary for their 
synthesis. It is therefore not surprising that amino acid biosynthesis is controlled by feedback 
inhibition, with the presence of a particular amino acid slowing down or entirely terminating its 
own production [for a review of the feedback mechanism in amino acid biosynthetic pathways, 
1 0 see Stryer, L, Biochemistry, 3rd edition, Chapter 24, "Biosynthesis of Amino Acids and Heme", 
pages 575-600 (1988)]. The output of a particular amino acid is therefore restricted by the 
amount of this amino acid in the cell. 

Improvements in the preparation of fine chemicals by fermentation usually correlate with 
improvements in substance fluxes and yields. It ie important in this connection to prevent or 
1 5 reduce inhibition of important synthetic enzymes by intermediates or final products. It is likewise 
advantageous to prevent or reduce wastage of the carbon flux in unwanted products or side 
products. 

The essential amino acids are, as described above, necessary for humans and many mammals, 
for example for domestic animals. L-Methionine is important in this connection as methyl group 
20 donor for the biosynthesis of, for example, choline, creatine, adrenaline, bases and RNA and 
DMA, histidine, and for transmethylation after formation of S-adenosylmethionine or as sutfhydryl 
group donor for cyst formation. 

L-Methionine additionally appears to have a beneficial effect on depressions. 

Improvement in the quality of human and animal foods is therefore an important task of the 
25 human and animal food industries. This is necessary because, for example, amino acids such 
as L-lysine and L-tryptophan in plants are limiting for the supply to mammals. An amino acid 
pattern which is as balanced as possible is particularly advantageous for the quality of human 
and animal foods, because a large excess of one amino acid such as, for example, L-lysine has, 
above a particular concentration in the foodstuff, no further beneficial effect on the utilization of 
30 the foodstuff, because other amino acids suddenly become limiting. A further increase in the 
quality is possible only by adding further amino acids which are limiting under these conditions. 
Thus, in growing pigs, lysine is initially limiting. If the food contains sufficient lysine, threonine 
becomes the limiting amino acid. If threonine is also added sufficiently to the food, tryptophan is 
limited as next amino acid. The sequence of the first three limiting amino acids for chickens is as 
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follows: methionine, lysine and then threonine. This shows that these amino acids have an 
important function for optimal nutrition and must be present in a balanced ratio in the diet. 

Great care is therefore necessary in specific dosage of the limiting amino acid in the form of 
synthetic products in order to avoid amino acid imbalances. This is because addition of an 
5 essential amino acid stimulates protein digestion, which may elicit in particular deficiency 
situations for limiting amino acid in second or third place. 

Thus, in feeding trials for example of casein with additional doses of methionine, which is limited 
in casein, fatty degeneration of the liver has been found and could be eliminated only after 
additional dosage of tryptophan. 

1 0 A balanced addition of a plurality of amino acids is therefore necessary for high quality of human 
and animal food, depending on the organism. The aforementioned fermentative and other 
synthetic processes usually make it possible to obtain only one amino acid. 

It is an object of the present invention to develop a cost-effective process for synthesizing amino 
acids, advantageously the essential amino acids L-lysine and L-methionine, which are among 
1 5 the two most common limiting amino acids. 

We have found that this object is achieved by the process of the invention for preparing amino 
acids in transgenic organisms, wherein the process comprises the following steps: 

a) introduction of a nucleic acid sequence which codes for a threonine- 

degrading protein, or 

20 b) introduction of a nucleic acid sequence which increases threonine 

degradation in the transgenic organisms, and 

c) expression of a nucleic acid sequence mentioned under (a) or (b) in the 

transgenic organism. 

Threonine-degrading proteins advantageously mean proteins such as threonine aldolase or 
25 serine hydroxymethyltransferase, which convert threonine into acetaldehyde and glycine, 

threonine dehydrogenase which converts threonine into L-2-aminoacetoacetate with formation of 
NADH + H + , or threonine dehydratase which converts threonine into oxobutyrate with elimination 
of NH 3 and water. Threonine aldolase is advantageously used as thronine-degrading activity in 
the process of the invention. The activity of the aforementioned proteins and/or of the nucleic 
30 acid sequences coding for them can be increased in various ways. The nucleic acid sequences 
are advantageously expressed in an organism, and thus the activity in an organism is increased 
via the gene copy number and/or else the stability of the expressed mRNA is increased and/or 
the stability of the gene product is increased. A further possibility is to change the regulation of 
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the aforementioned nucleic acid sequences so that expression of the genes is increased. This 
can advantageously be achieved by heterologous regulatory sequences or by modifying, e.g. by 
mutation, the natural regulatory sequences present. It is also possible to combine the two 
advantageous methods together. 

In an advantageous embodiment of the process for preparing amino acids in transgenic 
organisms, the process comprises introducing in the abovementioned process step (a) a nucleic 
acid sequence which is selected from the group of nucleic acid sequences 

i) of a nucleic acid sequence having the sequence depicted in SEQ ID NO: 1 ; 

ii) of a nucleic acid sequence obtained owing to the degeneracy of the genetic code 
through back-translation of the amino acid sequence depicted in SEQ ID NO: 2, and 

iii) of a derivative of the nucleic acid sequence depicted in SEQ ID NO: 1 which codes for a 
polypeptide having at least 50% homology at the amino acid level with the amino acid 
sequence depicted in SEQ ID NO: 2, with a negligible reduction in the biological activity 
of the polypeptides; and 

subsequently expressing these nucleic acid sequences in a transgenic organism. 

This advantageous embodiment of the process for preparing amino acids in transgenic 
organisms thus appears as follows: 

a) introduction of a nucleic acid sequence selected from the group 

i) of a nucleic acid sequence having the sequence depicted in SEQ ID NO: 1 ; 

ii) of a nucleic acid sequence obtained owing to the degeneracy of the genetic code 
through back-translation of the amino acid sequence depicted in SEQ ID NO: 2, and 

iii) of a derivative of the nucleic acid sequence depicted in SEQ ID NO: 1 which codes 
for a polypeptide having at least 50% homology at the amino acid level with the 
amino acid sequence depicted in SEQ ID NO: 2, with a negligible reduction in the 
biological activity of the polypeptides; and 

b) expression of a nucleic acid sequence mentioned under (a) in a transgenic organism. 

In a further advantageous embodiment of the process for preparing amino acids in transgenic 
organisms, the process comprises introducing in the abovementioned process step (a) a nucleic 
acid sequence which is selected from the group of nucleic acid sequences 
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i) of a nucieic acid sequence obtained owing to the degeneracy of the genetic code through 
back-translation of the amino acid sequence depicted in SEQ ID NO: 3, SEQ ID NO: 4, 
SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID 

5 NO: 10; 

ii) of a derivative of the nucleic acid sequence which is obtained by back-translation of the 
amino acid sequence depicted in SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10 and which has at 
least 70% homology at the amino acid level with the aforementioned amino acid 

10 sequences, with a negligible reduction in the biological activity of the polypeptides; and 

subsequently expressing these nucleic acid sequences in a transgenic organism. 

After the introduction and expression of the nucleic acid sequences used in the processes of the 
invention, the transgenic organism is advantageously cultured and subsequently harvested. In 
the case where the transgenic organism is a microorganism such as a eukaryotic organism such 

15 as a fungus, an alga or a yeast or a prokaryotic organism such as a bacterium such as a 

bacterium of the genera Escherichia, Bacillus, Corynebacterium or Brevibacterium, the latter is 
cultured In a solid or liquid medium known to the skilled worker and usual for the particular 
organism. After culturing, the organisms are harvested. The amino acids can then be further 
processed directly in the human or animal food or for other applications, for example as 

20 disclosed in EP-B-0 533 039 or EP-A-0 615 693, which are incorporated herein by reference, or 
else further purified in a conventional way by extraction and precipitation or on an ion exchanger. 
Products of these various workups are amino acids or amino acid compositions still containing 
portions of the fermentation broth and of the cells in various amounts advantageously in the 
range from 0 to 100% by weight, preferably from 1 to 80% by weight, particularly preferably 

25 between 5 and 50% by weight, very particularly preferably between 5 and 40% by weight. 

In an advantageous embodiment of the invention, the organism is a plant whose amino acid 
content is advantageously modified by the introduced nucleic acid sequence. This is important 
for plant breeders because, for example, the nutritional value of plants for monogastric animals 
is limited by some essential amino acids such as lysine or methionine. This transgenic plant 

30 produced in this way is, after introduction of the nucleic acid, grown on or in a nutrient medium or 
else in soil culture and subsequently harvested. The plants can then be used directly as human 
or animal foods or else be further processed, it is also possible in this case to purify the amino 
acids further in a conventional way by extraction and precipitation or on an ion exchanger. 
Products of these various workups are amino acids or amino acid compositions still containing 

35 portions of the plant in various amounts advantageously in the range from 0 to 100% by weight, 



Wietanomics GmbH & Co. KGaA 



20020960 



PF 54195 



8 

preferably from 20 to 80% by weight, particularly preferably between 50 and 90% by weight, very 
particularly preferably between 80 and 99% by weight. The plants are advantageously used 
immediately without further workup. 

In a further embodiment of the invention, the organism is a microorganism such as bacteria of 
5 the genera Corynebacterium, Brevibacterium, Escherichia or Bacillus. These microorganisms 
are advantageously used in a fermentation process. 

Besides the sequence specified in SEQ ID NO: 1, the nucleic acid sequences which can be 
derived from the sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10, or derivatives thereof, it is 

10 advantageous also for other genes to be expressed and/or mutated in the organisms. It is 
particularly advantageous for at least one further gene of the L-lysine, L-threonine and/or L- 
methionine biosynthefic pathway additionally to be expressed in the organisms such as plants or 
microorganisms, and/or for genes whose regulation have been modified to be expressed. It is 
also possible and advantageous to have modified the regulation of the natural genes In such a 

1 5 way that the gene and/or its gene product is no longer subject to the control systems present in 
the organisms. This results in enhanced synthesis of the desired amino acids because, for 
example, feedback regulation is no longer present or is no longer present to the same extent. 

In a further embodiment of the process of the invention, therefore, organisms are grown, 
advantageously bacteria of the genera Corynebacterium, Brevibacterium, Bacillus or Escherichia 

20 or plants, in which there is simultaneous overexpression of at least one nucleic acid or one of 
the genes which code for proteins selected from the group of gene products consisting of 
aspartate kinase (lysC), of aspartate-semialdehyde dehydrogenase (asd), of glyceraldehyde- 
3~phosphate dehydrogenase (gap), of 3-phosphoglycerate kinase (pgk), of pyruvate 
carboxylase (pyc), of triosephosphate isomerase (tpi), of homoserine O-acetyltransferase 

25 (metA), of cystahionine y-synthase (metB), of cystahionine gamma-lyase (metC), cystahionine 
B-lyase, of methionine synthase (metH), of serine hydroxymethyliransferase (glyA), of 
O-acetylhomoserine sulfhydrylase (metY), of methylenetetrahydrofolate reductase (metF), of 
phosphoserine aminotransferase (serC), of phosphoserine phosphatase (serB), of serine 
acetyltransferase (cysE), of cysteine synthase (cysK), of homoserine dehydrogenase (horn) and 

30 S-adenosylmethionine synthase (metX). 

In a further advantageous embodiment of the process of the invention, the organisms used in 
the process are those in which simultaneously at least one of the aforementioned genes or one 
of the aforementioned nucleic acids is mutated so that the activity of the corresponding proteins 
is influenced by metabolites to a smaller extent compared with the unmutated proteins, or not at 
35 all, and that in particular the production according to the invention of the amino acids is not 
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impaired, or so that their specific enzymatic activity is increased. Less influence means in this 
connection that the regulation of the enzymic activity is less by at least 10%, advantageously at 
least 20, 30 or 40%, particularly advantageously by at least 50, 60 or 70%, compared with the 
starting organism, and thus the activity of the enzyme is increased by these figures mentioned 
5 compared with the starting organism. An increase in the enzymatic activity means an enzymatic 
activity which is increased by at least 10%, advantageously at least 20, 30 or 40%, particularly 
advantageously by at least 50, 60 or 70%, compared with the starting organism. This leads to an 
increased productivity of the desired amino acid or of the desired amino acids. 

In a further advantageous embodiment of the process of the invention, the organisms used in 
1 0 the process are those in which simultaneously at least one of the genes selected from 
homoserine kinase (thrB), threonine dehydratase (ilvA), threonine synthase (thrC), meso- 
diaminopimelate D-dehydrogenase (ddh), phosphoenol pyruvate carboxykinase (pek), glucose-6- 
phosphate 6-isomerase (pgi), pyruvate oxidase (poxB), dihydrodipicolinate synthase (dapA), 
dihydrodipicolinatc reductase (dapB) and diaminopicolinat© decarboxylase (lysA) is attenuated. 
1 5 in particular by reducing the rate of expression of the corresponding gene. 

In another embodiment of the process of the Invention, the urganisms used in the process are 
those in which simultaneously at least one of the aforementioned nucleic acids or of the 
aforementioned genes is mutated in such a way that the enzymatic activity of the corresponding 
protein Is partially reduced or completely blocked. A reduction in the enzymatic activity means an 
20 enzymatic activity which is reduced by at least 10%, advantageously at least 20, 30 or 40%, 
particularly advantageously by at least 50, 60 or 70%, compared with the starting organism. 

The activity of enzymes can be influenced in such a way that there is a reduction or Increase in 
the reaction rate, or a modification (reduction or increase) in the affinity for the substrate. 

Microorganisms of the genera Corynebacterium or Brevibacterium or plants are preferably 
25 employed in the process of the invention. 

It is also possible to prepare chemically pure amino acids or amino acid compositions by the 
processes described above. For this purpose, the amino acids or the amino acid compositions 
are isolated from the organism such as the microorganisms or the plants or the culture medium 
in which or on which the organisms have grown, or from the organism and the culture medium, 
30 in a known manner. These chemically pure amino acids or amino acid compositions are 
advantageous for applications in the food industry, the cosmetics industry or the drugs industry 
sectors. 

Amino acids such as methionine, lysine or mixtures thereof, preferably methionine, are 
advantageously prepared by the process of the invention. 
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it is moreover possible to increase the aforementioned amino acids in the process of the 
invention by at least a factor of 3, preferably by at least a factor of 5, particularly preferably by at 
least a factor of 10, very particularly preferably by at least a factor of 50, compared with the wild 
type of the organisms. 

5 It is possible in principle to increase by the process of the invention the amino acids prepared in 
the organisms used in the process in two ways. It is possible advantageously to increase the 
pool of free amino acids and/or the proportion of amino acids prepared by the process in the 
proteins. The process of the invention advantageously increases the pool of free amino acids in 
the transgenic organisme. In the advantageous case of fermentation of microorganisms, the 
10 amino acids are enriched in the medium. 

Suitable in principle for the process of the invention are al! eukaryotic or prokaryotic organisms 
able to synthesize methionine and/or lysine. The organisms used in the process are 
advantageously microorganisms such as bacteria, fungi, yeasts or algae or plants such as 
dicotyledonous or monocotyledonous plants such as plants of the Aceraceae, Anacardiaceae, 

15 Apiaceae, Asteraceae, Brassicaceae, Cactaceae, Cucurbitaceae, Euphorbiaceae, Fabaceae, 
Malvaceae, Nymphaeaceae, Papaveraceae, Rosaceae, Salicaceae, Solanaceae, Arecaceae, 
Bromeliaceae, Cyperaceae, Iridaceae, Liiiaceae, Orchidaceae, Gentianaceae, Labiaceae, 
Magnoliaceae, Ranunculaceae, Caprifolaneae. Rubiaceae, Scrophuiariaceae, Caryophyllaceae, 
Ericaceae, Polygonaceae, Violaceae, Juncaceae or Poaceae families, preferably a plant 

20 selected from the group of families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, 
Fabaceae, Papaveraceae, Rosaceae, Solanaceae, Liiiaceae or Poaceae. 

Advantageous in the process of the invention are transgenic microorganisms such as fungi such 
as the genus Claviceps or Aspergillus or Gram-positive bacteria such as the genera Bacillus, 
Corynebacterium, Micrococcus, Brevibacterium, Rhodococcus, Nocardia, Caseobacter or 

25 Arthrobacter or Gram-negative bacteria such as the genera Escherichia, Flavobacterlum or 
Salmonella or yeasts such as the genera Rhodotorula, Hansenula or Candida. Particularly 
advantageous organisms are selected from the group of genera Corynebacterium, 
Brevibacterium, Escherichia, Bacillus, Rhodotorula, Hansenula, Candida, Claviceps or 
Flavobacterium. It is very particularly advantageous to use in the process of the invention 

30 microorganisms selected from the group of genera and species consisting of Hansenula 

anomala, Candida utiiis, Claviceps purpurea, Bacillus circulans, Bacillus subtilis, Bacillus sp., 
Brevibacterium albidum, Brevibacieiium album, Drevibacterium ccrinum, Brevibacterium flavum, 
Brevibacterium glutamigenes, Brevibacterium iodinum, Brevibacterium ketoglutamicum, 
Brevibacterium lactofermentum, Brevibacterium linens, Brevibacterium roseum, Brevibacterium 

35 saccharolyticum, Brevibacterium sp., Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium ammoniagenes, Corynebacterium glutamicum 
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(= Micrococcus giutamicum), Corynebacterium melassecola, Corynebacterium sp. or 
Escherichia coli, specifically Escherichia coli K12 and its described strains. 

It is advantageous to use in the process of the invention transgenic plants selected from the 
group of useful plants. Such as plants selected from the group of peanut, oilseed rape, canola, 
5 sunflower, safflower, olive, sesame, hazelnut, almond, avocado, bay, pumpkin, flax, soybean, 
pistachio, borage, corn, wheat, rye, oats, millet, triticale, rice, barley, cassava, potato, sugar 
beet, feed beet, aubergine, and perennial grasses and feed crops, oil palm, vegetables 
(brassicas, roots, tubers, legumes, fruit vegetables, bulbs, leaf and stem vegetables), 
buckwheat, Jerusalem artichoke, hmari hean. vetches, lentil, dwarf bean, alfaalfa, lupin, clover 
10 and lucerne. 

The nucleic acid sequence(s) used in the process for preparing amino acids in transgenic 
organisms are advantageously derived from a eukaryote (the plural is intended to include the 
singular and vice versa for the invention), but may also be derived from a prokaryote. The 
nucleic acid sequences are advantageously derived from a plant such as a plant selected from 

15 the Aceraceae, Anacardiaceae, Apiaceae, Asteraceae, Brassicaceae, Cactaceae, 

Cucurbitaceae, Euphorbiaceae, habaceae, Malvaceae, Nymphaeaceae, Papaveiaceae, 
Rosaceae, Salicaceae, Solanaceae, Arecaceae, Bromeliaceae, Cyperaceae, Iridaceae, 
I iliaceae, Orchidaceae. Gentianaceae. Labiaceae, Magnoliaceae, Ranunculaceae, 
Carifolaceae, Rubiaceae, Scrophulariaceae, Caryophyllaceae, Ericaceae, Polygonaceae, 

20 Violaceae, Juncaceae or Poaceae families, preferably a plant selected from the group of 
families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, Fabaceae, Papaveraceae, 
Rosaceae, Solanaceae, Liliaceae or Poaceae, a fungus such as the genera Aspergillus, 
Penicillum or Claviceps or a yeast such as the genera Pichia, Torulopsis, Hansenula, 
Schizosaccharomyces, Candida, Rhodotorula or Saccharomyces. The sequences are 

25 particularly advantageously derived from yeasts such as the genera Pichia, Torulopsis, 

Hansenula, Schizosaccharomyces, Candida, Rhodotorula or Saccharomyces, very particularly 
advantageously from yeast of the Saccharomycetaceae family such as the advantageous genus 
Saccharomyces and the particularly advantageous genus and species Saccharomyces 
cerevisiae. 

30 The nucleic acid sequences used in the process of the invention and having the sequence 

SEQ ID NO: 1 code for a threonine aldolase. This aldolase shows the highest homology with the 
GLY1 protein from A. gossypii [Eremothocium ashbii, Eremothecium gossypii] (FMRI database 
accession No. AJ005442, CAA06545.1, identity at the amino acid level with SEQ ID NO: 1 of 
76%). Homologies with a large number of nucleic acids can additionally be found. The threonine 

35 aldolase from yeasts such as Candida albicans (EMBL database accession No. AF009967, 
AAB64198.1, identity at the amino acid level with SEQ ID NO: 1 of 56%), from 
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Schizosaccharomyces pombe (EMBL database accession No. Z99163, CAB1 6235.1, identity at 
the amino acid level with SEQ ID NO: 1 of 49%), or from bacteria such as Aeromonas jandaei 
(EMBL database accession No. AF 169478, AAD47837.1, identity at the amino acid level with 
SEQ ID NO: 1 of 41%), Pseudomonas aeruginosa (EMBL database accession No. AF011922, 
5 AAC46016.1, Identity at the amino acid level with SEQ ID NO: 1 of 38%), Vibrio cholerae (EMBL 
database accession No. AE004405, AAF96663.1, identity at the amino acid level with SEQ ID 
NO: 1 of 38%), Escherichia coli (EMBL database accession No. AB005050, BAA20882.1 , 
identity at the amino acid level with SEQ ID NO: 1 of 38%), Deinococcus radiodurans (EMBL 
database accession No. AE001 978, AAF1 0885.1 , identity at the amino acid level with SEQ ID 

10 NO: 1 of 38%), Bacillus halodurans (EMBL database accession No. AP00151B, BAB07002.1, 
identity at the amino acid level with SEQ ID NO: 1 of 34%), Halobacterium sp. (EMBL database 
accession No. AE005124, AAG20528.1), Thermotoga maritima (EMBL database accession 
No. AE001813, AAD36809.1 , identity at the amino acid level with SEQ ID NO: 1 of 40%) or the 
plants Arabidopsis thaliana (EMBL database accession No. AF325033, AAG40385.1 , 

15 AC022287, AAF63783.1 , AC003981, AAC14037.1, identity at the amino acid level with SEQ ID 
NO: 1 of in each case 40, 42 or 37%) or from nonhuman animals such as Caenorhabditis 
elegans (EMBL database accession No. 770309, CAA94358.1 . identity at the amino acid level 
with SEQ ID NO: 1 of 41%) or Drosophila melanogaster (EMBL database accession No. 
AE003744, AAF56152.1, identity at the amino acid level with SEQ ID NO: 1 of 39%) or the 

20 alanine racemase from fungi such as Cochliobolus carbonum/Bipolaris zeicola (EMBL database 
accession No. AF169478, AAD47837.1, identity at the amino acid level with SEQ ID NO: 1 of 
38%). It is advantageous to U3e in the process of the invention nucleic acid sequences and 
proteins encoded thereby which are derived from yeasts of the genera Candida, Hansenula, 
Rhodotorula, Schizosaccharomyces or Saccharomyces. The aldolase which is advantageously 

25 used in the process of the invention additionally shows high homology with the sequences 

specified in SEQ ID NO: 3 (identity at the amino acid level with SEQ ID NO: 1 of 35%), SEQ ID 
NO: 4 (identity at the amino acid level with SEQ ID NO. 1 of 35%), SEQ ID NO: 5 (identity at the 
amino acid level with SEQ ID NO: 1 of 27%), SEQ ID NO: 6 (identity at the amino acid level with 
SEQ ID NO: 1 of 43%), SEQ ID NO: 7 (identity at the amino acid level with SEQ ID NO: 1 of 

30 39%), SEQ ID NO: 8 (identity at the amino acid level with SEQ ID NO: 1 of 32%), SEQ ID NO: 9 
(identity at the amino acid level with SEQ ID NO: 1 of 35%) or SEQ ID NO: 10 (identity at the 
amino acid level with SEQ ID NO: 1 of 36%) and which are derived from soybean (SEQ ID NO: 
3 - 5), rice (SEQ ID NO: 6 and 7) and from canola (SEQ ID NO: 8 - 1 0). It is possible and 
advantageous to use in the process nucleic acid sequences derived from the amino acid 

35 sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID 
NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10. 

Nucleic acid sequences which are advantageous for the process of the invention and which 
code for polypeptides having aldolase activity can be found in generally accessible databases. 
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Particular mention should be made in this connection of general gene databases such as the 
EMBL database (Stoesser G. et a!., Nucleic Acids Res 2001 , Vol. 29, 17-21), of the GenBank 
database (Benson DA et a!., Nucleic Acids Res 2000, Vol. 28,15-15), or the PIR database 
(Barker W..C. et al., Nucleic Acids Res. 1999, Vol. 27, 39-43). 

5 It is additionally possible to use organism-specific gene databases for finding advantageous 
sequences, e.g. advantageously for yeast the SGD database (Cherry J. M. et al., Nucleic Acids 
Res. 1998, Vol. 26, 73-80) or the MIPS database {Mewes H.W. et al., Nucleic Acids Res. 1999, 
Vol. 27, 44-48), for E. coli the GenProtEC database 

(http://web.bham.ac.uk/bcm4ght6/res.html), for Arabidopsis the TAIR database (Huala, E. et al., 
10 Nucleic Acids Res. 2001 Vol. 29(1), 102-5) or the MIPS database. 

In order to improve the introduction of the nucleic acid sequences and the expression of the 
sequences in the transgenic organisms used in the process, the nucleic acid sequences are 
inserted into a nucleic acid construct and/or a vector. In addition to the sequences described 
above and used in the process of the invention, further nucleic acid sequences, advantageously 
1 5 of biosynthesis genes of the amino acid prepared in the process, may be present in the nucleic 
acid construct or in the vector and are inserted together into the organism. These additional 
sequences may, however, also be inserted directly or via other separate nucleic acid constructs 
or vectors into the organisms. 

The nucleic acid sequences used in the process of the invention are isolated nucleic acid 
20 sequences coding for polypeptides having aldolase activity. 

Nucleic acids mean in the process of the invention DNA or RNA sequences which may be 
single- or double-stranded or may, where appropriate, have synthetic, unnatural or modified 
nucleotide bases which can be incorporated in DNA or RNA. 

The term "expression" means the transcription and/or translation of a codogenic gene segment 
25 or gene. The resulting product is usually a protein. However, the products also include functional 
RNAs such as, for example, ribozymes. Expression may take place systemically or locally, e.g. 
confined to particular cell types, tissues or organs. 

The expression products of the nucleic acids, e.g. of the codogenic gene segments (ORFs) and 
of their regulatory elements, can be characterized by their function. Included in this are, for 
30 example, functions in the areas of metabolism, energy, transcription, protein synthesis, protein 
processing, cellular transport and transport mechanisms, cellular communication and signal 
transduction, cell rescue, cell defense and eel! virulence, regulation of the cellular environment 
and interaction of the cell with its environment, cell fate, transposable elements, virai proteins 
and plasmid proteins, cellular organization monitoring, subcellular localization, regulation of 
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protein activity, proteins with binding function or cofactor requirement and transport facilitation. 
Genes of identical function are combined to so-called functional gene families. 

It is possible through the biological activity of the nucleic acids which are used in the process of 
the invention and which code for polypeptides having threonine aldolase activity for different 
5 amino acids to be prepared or the preparation thereof to be improved and/or increased. Mixtures 
of the various amino acids can be prepared, depending on the selection of the organism used 
for the process of the invention, for example a microorganism or a plant. 

Transgenic organisms in the process of the invention mean when plants are concerned also 
plant cells, tissues, organs such as root, shoot, stem, seed, flower, tuber or leaf or whole plants 
1 0 grown to prepare amino acids. Growing means, for example, culturing the transgenic plant cells, 
tissues or organs on or in the nutrient medium or the whole plant on or in a substrate, for 
example in hydroculture, flowerpot soil or on a field. 

If plants are chosen as donor organism In the process of the invention, it is possible in principle 
for this plant to have any phylogenetic relationship with the recipient plant. Thus, donor and 
1 5 recipient plants may belong to the same family, genus, species, variety or line, with the 

homology between the nucleic acids to be integrated and corresponding parts of the genome of 
the recipient plant increasing. The same also applies to microorganisms as donor and recipient 
organisms. 

It is advantageous to use in the process of the invention a nucleic acid sequence having the 
sequence depicted in SEQ ID NO: 1, nucleic acid sequences derived from amino acid 
sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID 
NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10, or derivative thereof or homologs which code for 
polypeptides which still have the enzymatic activity or biological activity. These sequences are 
cloned singly or in combination into expression constructs. These expression constructs make 
optimal synthesis of the amino acids produced in the process of the invention possible. 

In a preferred embodiment, the process additionally includes the step of obtaining a cell which 
comprises the nucleic acid sequences which are used in the process and which code for an 
enzyme having threonine aldolase activity, where a cell is transformed with the nucleic acid 
sequences, with a gene construct (= nucleic acid construct) or with a vector, which bring about 
30 expression of the aldolase nucleic acid on its own or in combination with other genes or 

sequences. In a further preferred embodiment, this process also includes the step of obtaining 
the amino acid(s) or the amino acid mixture from the culture and/or the organism. The cell 
prepared in this way is advantageously a cell of a plant described as advantageous as above, or 
of a microorganism. 



20 
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Transgenic organism such as a plant or a transgenic microorganism means for the purposes of 
the invention that the nucleic acids used in the process are not at their natural site in the genome 
of an organism, and it is possible for the nucleic acids to be expressed homologously or 
heterologously. However, transgenic also means thai the nucleic acids of the invention are in 
5 their natural place in the genome of an organism but that the sequence has been modified 
compared with the natural sequence and/or that the regulatory sequences of the natural 
sequences have been modified. Transgenic preferably means expression of the nucleic acids 
used in the process of the invention at an unnatural site in the genome, i.e. there is homologous 
or, preferably, heterologous expression of the nucleic acids. Expression may moreover take 

10 place transiently or from a sequence stably integrated in the genome Preferred transgenic 
plants are, for example, the following plants selected from the Aceraceae, Anacardiaceae, 
Apiaceae, Asteraceae, Brassicaceae, Cactaceae, Cucurbitaceae, Euphorbiaceae, Fabaceae, 
Malvaceae, Nymphaeaceae, Papaveraceae, Rosaceae, Salicaceae, Solanaceae, Arecaceae, 
Bromeliaceae, Cyperaceae, lridaceae, Liliaceae, Orchidaceae, Gentianaceae, Labiaceae, 

15 Magnollaceae, Ranunculaceaa, Carifolaceae, Rubiaceac, Scrophulariaceae, Caryophyliaceae, 
Ericaceae, Polygonaceae, Violaceae, Juncaceae or Poaceae families, preferably a plant 
selected from the group of families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, 
Fabaceae, Papaveraceae, Rosaceae, Solanaceae, Liliaceae or Poaceae. Further advantageous 
preferred plants are useful plants advantageously selected from the group of the genus of 

20 peanut, oilseed rape, canola, sunflower, safflower, olive, sesame, hazelnut, almond, avocado, 
bay, pumpkin, flax, soybean, pistachio, borage, corn, wheat, rye, oats, millet, triticale, rice, 
barley, cassava, potato, sugar beet, aubergine, alfaalfa and perennial grasses and feed crops, 
oil palm, vegetables (brassicas, roots, tubers, legumes, fruit vegetables, bulbs, leaf and stem 
vegetables), buckwheat, Jerusalem artichoke, broad bean, vetches, lentil, dwarf bean, lupin, 

25 clover and lucerne. 

The term "transgenic plant" used according to the invention also refers to the progeny of a 
transgenic plant, e.g. the T r , T 2 -, T 3 - and subsequent plant generations or the BC r , BC 2 -, BC 3 - 
and subsequent plant generations. Thus, the transgenic plants of the invention can be grown 
and crossed with themselves or other individuals in order to attain further transgenic plants of 

30 the invention. Transgenic plants can also be obtained by vegetative propagation of transgenic 
plant cells Thfi present invention also relates to transgenic plant material which can be derived 
from a population according to the invention of transgenic plants. This includes plant cells and 
certain tissues, organs and parts of plants in ail their manifestations, such as seeds, leaves, 
anthers, fibers, roots, root hairs, stems, embryos, calli, cotyledons, petioles, harvest material, 

35 plant tissue, reproductive tissue and ceii cultures, which is derived from the actual transgenic 
plant and/or can be used to produce the transgenic plant. 
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Transgenic plants containing the amino acids synthesized in the process of the invention can be 
marketed directly without isolating the synthesized compounds. Plants mean in the process of 
the invention all plant parts, plant organs such as leaf, stalk, root, tubers or seeds or the whole 
plant. The seed includes in this connection ail seed parts such as the seed cases, epidermal and 
5 seed cells, endosperm or embryo tissue. The amino acids prepared in the process of the 

invention may, however, also be isolated from the plants in the form of their free amino acids or 
bound in proteins. Amino acids prepared by this process can be harvested by harvesting the 
organisms either from the culture in which they are growing, or from the field. This can take 
place by pressing, grinding and/or extraction, salt precipitation and/or ion exchange 
10 chromatography ot the plant parts, preferably of the plant seeds, fruit, tubers, etc. 

It is possible in this way to isolate more than 50% by weight, advantageously more than 60% by 
weight, preferably more than 70% by weight, particularly preferably more than 80% by weight, 
very particularly preferably more than 90% by weight, of the amino acids prepared in the 
process. The amino acids obtained in this way can then be further purified where appropriate, 
15 mixed if desired with other active ingredients such as vitamins, amino acids, carbohydrates, 
antibiotics, etc, and formulated where appropriate. 

A further embodiment according to the invention is the use of the amino acids prepared in the 
process or of the transgenic organisms in animal or human foods, cosmetics or 
pharmaceuticals. 

20 The nucleic acids used in the process can be integrated after introduction into a plant cell or 
plant either in the plastid genome or, preferably, in the genome of the host ceil, and transient 
expression is possible and can be used advantageously. Production through, for example, viral 
infection with recombinant virus is also possible in principle, and in this case the expression of 
the gene or genes is advantageously increased. On integration into the genome, the integration 

25 may be random or take place via recombination such that the native gene is replaced by the 
introduced copy, thus modulating production of the desired compound by the ceil, or by use of a 
gene in trans so that the gene is functionally connected to a functional expression unit which 
comprises at least one sequence ensuring expression of a gene and at least one sequence 
ensuring potyadenylation of a functionally transcribed gene. The nucleic acids are 

30 advantageously put into the plants via multiexpression cassettes or constructs for multiparallel 
expression of genes. In a further advantageous embodiment, the nucleic acid sequence is 
introduced in a simple expression cassette or a simple construct, i.e. without other different 
nucleic acid sequences, into the plant. Heterologous nucleic acid sequences are preferably 
introduced. 
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it is possible by using cloning vectors in plants and in the plant transformation such as those 
published and cited in: Plant Molecular Biology and Biotechnology (CRC Press, Boca Raton, 
Florida), Chapter 6/7, pages 71-1 19 (1993); F.F. White, Vectors tor Gene Transfer in 
Higher Plants; in: Transgenic Plants, Vol. 1, Engineering and Utilization, editors: Kung and 
5 R. Wu, Academic Press, 1993, 15-38; B Jenes et aL. Techniques for Gene Transfer, in: 
Transgenic Plants, Vol. 1, Engineering and Utilization, editors: Kung and R. Wu, Academic 
Press (1993), 128-143; Potrykus, Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991), 
205-225 to use the nucleic acids for genetic manipulation of a wide range of plants so that the 
latter becomes a better or more efficient producer of the amino acids prepared in the process of 
1 0 the invention. This improved production or efficiency of production of the amino acids or 

products derived therefrom, such as modified proteins, can be brought about by a direct effect of 
the manipulation or an indirect effect of this manipulation. 

There is a number of mechanisms by which the modification of the threonine aldolase protein 
used in the process of the invention can directly influence the yield, production and/or efficiency 

1 5 of production of the amino acids from one the transgenic plants or the microorganisms such as 
a yeast, a fungus or a bacterium on the basis of a modified protein. The number or activity of the 
threonine aldolase protein or gene can be increased so that this enzymic activity results In larger 
amounts of the desired product being prepared de novo because the organisms for example 
bckod the introduced enzymatic activity and thus the ability to increase the biosynthesis before 

20 introduction of the corresponding gene. However, expression of the gene naturally present in the 
organisms can also be increased, for example through a modified regulation of the gene, or the 
stability of the mRNA or of the gene product, i.e. of the aldolase, can be increased. 
Corresponding statements apply to the combination with other enzymes useful for synthesizing 
the amino acids from the biosynthesis metabolism. The use of various divergent sequences, i.e. 

25 ones which are different at the DNA sequence level, may also be advantageous in this 

connection, or the use of promoters for the gene expression which makes gene expression at a 
different time possible. 

It is possible by introducing a threonine aldolase gene or a plurality of aldolase genes into an 
organism alone or in combination with other genes not only to increase the biosynthetic flux to 

30 the final product but also to increase, alter or create de novo a product composition present in 
the organism. It is likewise possible to increase the number or activity of other genes in the 
import or export of nutrients of the cell(s) which are necessary for biosynthesis of the amino 
acids, SO that the concentration of these precursors, cofactors or intermediates within the cell(s) 
or within the storage compartment is increased, thus further increasing the ability of the cells to 

35 produce amino acids, as described below. The yield, production and/or efficiency of production 
of amino acids in the host organism, such as the plants or the microorganisms, can be 
increased by optimizing the activity or increasing the number of threonine aldolase nucleic acid 
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sequences and/or further genes involved in the biosynthesis of the amino acids, or by destroying 
the activity of one or more genes involved in the degradation of the amino acids. 

Through this influencing of metabolism it is possible in the process of the invention to prepare 
further advantageous sulfur-containing compounds which comprise at least one covalently 
5 bonded sulfur atom. Examples of such compounds are besides methionine, homocysteine, 
S-adenosylmethionine, cysteine, advantageously methionine and S-adenosylmethionine. 

The terms "L-methionine", "methionine", "homocysteine" and "S-adenosylmethionine" also 
include for the purposes of the present invention the corresponding salts such as, for example, 
methionine hydrochloride or methionine sulfate. The terms methionine or threonine are also 
1 0 intended to include the terms L-methionine or L-threonine. 

The isolated nucleic acid molecules used in the process of the invention code for proteins or 
parts thereof, where the proteins or the individual protein or parts thereof comprises an amino 
acid sequence which is sufficiently homologous With an amino auid sequence of the sequence 
SEQ ID NO: 2 that the protein or the part thereof retains a threonine aldolase activity. The 

1 5 . protein or the part thereof which is encoded by the nucleic acid molecule preferably has its 
essential enzymatic or biological activity and the ability to take part in the metabolism of amino 
acids in plants or microorganisms and generally in plant or microorganism metabolism or in the 
transport of molecules across membranes. The protein encoded by the nucleic acid molecules Is 
advantageously at least about 30%, 35%, 40%, 45% or 50%, preferably at least about 60% and 

20 more preferably at least about 70%, 80% or 90% and most preferably at least about 95%, 96%, 
97%, 98%, 99% or more homologous with an amino acid sequence of the sequence SEQ ID 
NO: 2. The protein is preferably a full-length protein which is substantially in parts homologous 
with a complete amino acid sequence of the SEQ ID NO: 2 (which is derived from the open 
reading frame shown in SEQ ID NO: 1). Further advantageous further nucleic acid sequences 

25 used in the process of the invention are derived from tho amino acid sequences SEQ ID NO: 3, 
SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or 
SEQ ID NO: 10. The proteins encoded by these derived nucleic acid molecules are 
advantageously at least about 70% or 75%, preferably about at least 80% or 85%, more 
preferably at least about 90%, 91 %, 92% or 94% and most preferably at least about 95%, 96%, 

30 97%, 98%, 99% or more homologous with the amino acid sequences encoded by them or with 
an amino acid sequence of the sequence SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6, SFQ ID NO- 7, SEO ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10. Homology or 
homologous means for the purposes of the invention identity or identical. 

Essential enzymatic or biuluyical activity of the enzymes used means that, compared with the 
35 proteins/enzymes encoded by the sequences having SEQ ID NO: 1 or the sequences derived 
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from the sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5 T SEQ ID NO: 6, SEQ ID NO: 
7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10, and derivatives thereof, they still have at 
least an enzymatic or biological activity of at least 10%, preferably 20%, particularly preferably 
30% and very especially 40% and are thus able to take part in the metabolism of amino acids in 
5 a plant or microorganism cell necessary compounds or in the transport of molecules across 
membranes, where the amino acids methionine or lysine are advantageously meant. 

Nucleic acids which can be used in the process are advantageously derived from yeasts such as 
of the Saccharomycetaceae family such as the advantageous genus Saccharomyces or yeast 
genera such as Candida, Hansenula, Rhodntnnila nr Schizosaccharomyces and the particularly 
10 advantageous genus and species Saccharomyces cerevisiae. Its sequence is deposited under 
the EMBL accession numbers U18779, L10830 and U00092 in the EMBL database as product 
Glylp (protein required for glycine prototrophy), CDS complement (14603.. 15766) with the 
protein ID ="AAB64996.1" and db_xref-'GI: 603634". 

An alternative possibility is to use in the process of the invention isolated nucleotide sequences 
15 which code for putative aldolases and which hybridize onto a nucleotide sequence of SEQ ID 
NO: 1 or, in another advantageous embodiment, onto a sequence derived from the sequences 
SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9 or SEQ ID NO: 10, e.g hybridize under stringent conditions. The hybridization 
should advantageously be carried out with fragments of a length of at least 200 bp, 
20 advantageously at least 400 bp, preferably at least 600 bp, particularly preferably of at least 800 
bp, very particularly preferably of at least 1000 bp. In a particularly preferred embodiment, the 
hybridization should be carried out with the complete nucleic acid sequence. 

The nucleic acid sequences used in the process are advantageously introduced in an 
expression cassette (= nucleic acid construct) which makes expression of the nucleic acids 
25 possible in an organism, advantageously a piant or a microorganism. 

For the introduction, the codogenic gene segment is advantageously subjected to an 
amplification and ligation in a known manner. The procedure is preferably based on the Pfu DNA 
polymerase protocol or a Pfu/Taq DNA polymerase mixture protocol. The primers are chosen on 
the basis of the sequence to be amplified. The primers should expediently be chosen so that the 

30 amplicon includes the complete codogenic sequence from start codon to stop codon. Following 
the amplification, the amplicon is expediently analyzed. The analysis can take place for example 
with regard to quality and quantity after fractionation by gel electrophoresis. The amplicon can 
then be purified in accordance with a standard protocol (e.g. Qiagen). An aliquot of the purified 
amplicon is then available for subsequent cloning. Suitable cloning vectors are generally known 

35 to the skilled worker. These include in particular vectors which are able to replicate in bacterial 
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systems, i.e. especially vectors which ensure efficient cioning in E. coli, and which make stable 
transformation of plants possible. Mention should be made in particular of various binary and 
cointegrated vector systems suitable for T-DNA-mediated transformation. Vector systems of this 
type are usually characterized by comprising at least the vir genes necessary for agrobacterium- 
5 mediated transformation, and the T-DNA border sequcncco. These vector systems preferably 
also comprise further cis-regulatory regions such as promoters and terminators and/or selection 
markers with which appropriately transformed organisms can be identified. Whereas vir genes 
and T-DNA sequences are arranged on the same vector in cointegrated vector systems, binary 
systems are based on at least two vectors, one of which harbors a vir gene but no T-DNA, and a 

10 second harbors I -UNA but no vir gene. This makes the lattei vectors relatively small, easy to 
manipulate and easy to replicate both in E. coli and in Agrobacterium. These binary vectors 
include vectors of the pBIB-HYG, pPZP, pBecks, pGreen series. Preferably used according to 
the invention are Bin19, pBI101, pBinAR, pGPTV and pCAMBIA. A review of binary vectors and 
their use is given by Hellens et aL Trends in Plant Science (2000} 5, 446-451 . For vector 

15 preparation, the vectors can be initially linearized with restriction endonuclease(s) and then 
enzymatically modified in a suitable way. The vector is subsequently purified, and an aliquot is 
employed for the cloning In the r.lnning, the enzymatically cut and, if necessary, purified 
amplicon is cloned with similarly prepared vector fragments using ligase. It is moreover possible 
for a particular nucleic acid construct or vector or plasmid construct to have one or else more 

20 than one codogenic gene segments. The codogenic gene segments in these constructs are 
preferably functionally linked to regulatory sequences. The regulatory sequences include in 
particular plant aequences such ao tho promoters and terminators described above. The 
constructs are advantageously capable of stable propagation in microorganisms, especially 
Escherichia coli and Agrobacterium tumefaciens, under selective conditions, and make transfer 

25 of heterologous DNA possible into plants or other microorganisms. In a particular embodiment, 
the constructs are based on binary vectors (review of binary vectors in Hellens et al„ 2000). The 
latter usually comprise prokaryotic regulaioiy sequences such as origin of replication and 
selection markers for replication in microorganisms such as Escherichia coli and Agrobacterium 
tumefaciens, and agrobacterium T-DNA sequences for the purpose of transferring DNA into 

30 plant genomes. Of the complete Agrobacterium T-DNA sequence, at least the right border 
sequence comprising about 25 base pairs is required. The vector constructs of the invention 
usually comprise T-DNA sequences both from the right and from the left border region, which 
expediently comprise recognition sites for enzymes which act site-specifically and which in turn 
are encoded by part of the vir genes Suitable host organisms are known to the skilled worker. 

35 Advantageous organisms are described above in this application. These include in particular 
bacterial hosts, of which some have already been mentioned above in connection with donor 
microorganisms, e.g. microorganisms such as fungi such as the genus Claviceps or Aspergillus 
or Gram-positive bacteria such as the genera Bacillus, Corynebacterium, Micrococcus, 
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Brevibacterium, Rhodococcus, Nocardia, Caseobacter or Arthrobacter or Gram-negative 
bacteria such as the genera Escherichia, Flavobacterium or Salmonella or yeasts such as the 
genera Rhodotoruia, Hansenula or Candida, Particularly advantageous organisms are selected 
from the group of genera Corynebacterium, Brevibacterium, Escherichia, Bacillus, Rhodotoruia, 
5 Hansenula, Candida, Claviceps or Flavobacterium. It is very particularly advantageous to use in 
the process of the invention microorganisms selected from the group of genera and species 
consisting of Hansenula anomala, Candida utilis, Claviceps purpurea, Bacillus circulans, Bacillus 
subtilis, Bacillus sp., Brevibacterium albidum, Brevibacterium album, Brevibacterium cerinum, 
Brevibacterium flavum, Brevibacterium glutamigenes, Brevibacterium iodinum, Brevibacterium 

10 ketoglutamicum, Brevibacterium laciofermentum, Rrevihar.terium linens, Brevibacterium 
roseum, Brevibacterium saccharolyticum, Brevibacterium sp., Corynebacterium 
acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium ammoniagenes, 
Corynebacterium glutamicum {= Micrococcus glutamicum), Corynebacterium melassecola, 
Corynebacterium sp. or Escherichia coli, specifically Escherichia coli K12 and its described 

1 5 strains. Advantageously preferred according to the invention are host organisms of the genus 
Escherichia, in particular Escherichia coli, and Agrobacterium, in particular Agrobacterium 
tumefaciens, or plants selected from the Aceraceae, Anacardiaceae, Apiaceae, Asteraceae, 
Brassicaceae, Cactaceae, Cucurbitaceae, Euphorbiaceae, Fabaceae, Malvaceae, 
Nymphaeaceae, Papaveraceae, Rosaceae, Salicaceae, Solanaceae, Arecaceae, Bromeliaceae, 

20 Cyperaceae, Irldaceae, Uliaceae, Orohidaceae, Gentianaceae, Labiaceac, Magnoliaceao, 
Ranunculaceae, Carifolaceae, Rubiaceae, Scrophulariaceae, Caryophyilaceae, Ericaceae, 
Polygonaceae, Violaceae, Juncaceae or Poaceae families, preferably a plant selected from the 
group of families Apiaceae, Asteraceae, Brassicaceae, Cucurbitaceae, Fabaceae, 
Papaveraceae, Rosaceae, Solanaceae, Liliaceae or Poaceae. Further advantageous preferred 

25 plants are usefui plants advantageously selected from the group of the genus of peanut, oilseed 
rape, canola, sunflower, safflower, olive, sesame, hazelnut, almond, avocado, bay, pumpkin, 
flax, soybean, pistachio, borage, corn, wheat, rye, oats, millet, triticale, rice, barley, cassava, 
potato, sugar beet, feed beet, aubergine and perennial grasses and feed crops, oil palm, 
vegetables (brassicas, roots, tubers, legumes, fruit vegetables, bulbs, leaf and stem 

30 vegetables), buckwheat, Jerusalem artichoke, broad bean, vetches, lentil, alfaaifa, dwarf bean, 
lupin, clover and lucerne. For introducing the nucleic acids used in the process of the invention 
into a plant it has proved to be advantageous initially to transfer them into an intermediate host, 
e.g. a bacterium. Transformation into E. coli has proved expedient in this connection and can be 
carried out in a manner known per se, e.g. by heat shock or electroporation. Thus, the 

35 transformed E. coli colonies can be investigated for the cloning efficiency. This can take place 
with the aid of a PCR. It is moreover possible to examine both the identity and the integrity of the 
plasm id construct on the basis of a defined number of colonies by subjecting an aliquot of the 
colonies to said PCR. The primers employed for this purpose are generally universal primers 
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derived from vector sequences, with the forward primer being disposed upstream of the "start 
ATG and the reverse primer being disposed downstream of the stop codon of the codogenic 
gene segment. The ampiicons are fractionated by electrophoresis and assessed for quantity and 
quality. Detection of a fragment of the appropriate size leads to a positive assessment. The 
5 plasmid constructs which are examined where appropriate are subsequently used for 

transforming the plants. It may for this purpose initially be necessary to obtain the constructs 
from the intermediate host. The constructs can, for example, be obtained as plasm ids from 
bacteria! hosts on the basis of a conventional plasmid isolation. Numerous processes for 
transforming plants are known. Since stable integration of heterologous DNA into the genome of 

10 plants is advantageous according to the invention, T DNA-mediated transformation has proved 
to be particularly expedient. It is for this purpose initially necessary to transform suitable 
vehicles, especially agrobacteria, with the codogenic gene segment or the corresponding 
plasmid construct. This can take place in a manner known per se. For example, the plasmid 
construct produced in accordance with the above statements can be transformed by means of 

15 electroporation or neat shock into competent agrobacteria. A diatinctipn must be mad© in thie 
connection in principle between the formation of cointegrated vectors on the one hand and 
transformation with binary vectors. In the first alternative, the vector constructs including the 
codogenic gene segment have no T-DNA sequences; on the contrary, the formation of the 
cointegrated vectors takes place in the agrobacteria through homologous recombination of the 

20 vector construct with T-DNA. The T-DNA is present In the agrobacteria in Hits form of Ti or Ri 
plasmids in which the oncogenes have expediently been replaced by exogenous DNA. On use 
of binary vectors it is possible to transfer them by bacterial conjugation or direct transfer to 
agrobacteria. These agrobacteria expediently already contain the vector which harbors the vir 
genes (frequently referred to as helper Ti{Ri) piasmid). Together with the plasmid construct and 

25 T-DNA it is expediently possible also to use one or more markers using which it is possible to 
select transformed agrobacteria and transformed plant-cells. A large number of markers has 
been developed for this purpose. These include, for example, those conferring resistance to 
chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin and the like. It is usually 
desired for the plasmid constructs to be flanked on one or both sides of the codogenic gene 

30 segment by T-DNA. This is particularly useful when bacteria of the gene species Agrobacterium 
tumefaciens or Agrobacterium rhizogenes are used for the transformation. A method preferred 
according to the invention is transformation using Agrobacterium tumefaciens. However, biolistic 
methods can also be used advantageously for inserting the sequences in the process of the 
invention, and insertion using PEG is also possible. The transformed agrobacteria can be 

35 cultured in a manner known per se and are thus available for expedient transformation of the 
plants. The plants or plant parts to be transformed are grown or provided in a conventional way. 
The plants or plant parts are then exposed to the transformed agrobacteria until an adequate 
transformation rate is reached. The plants and plant parts can be exposed to agrobacteria in 
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various ways. For example, a culture of morphogenic plant cells or tissues can be used. 
Following the T-DNA transfer, the bacteria are usually eliminated by antibiotics, and the 
regeneration of plant tissue is induced. Suitable plant hormones are used in particular for this 
purpose in order, after initial callus formation, to promote the formation of shoots. An 
5 advantageous transformation method is in planta transformation. For this purpose, it is possible 
to expose plant seeds for example to the agrobacteria, or to inoculate plant meristem with 
agrobacteria. It has proved particularly expedient according to the invention to expose the whole 
plant or at least the flower primordia to a suspension of transformed agrobacteria. The former is 
then grown further until seeds of the treated plant are obtained (Clough and Bent, Plant J. (1998) 

10 1G, 735-743). To select transformed plants, the plant material obtained from thft transformation 
is usually subjected to selective conditions so that transformed plants can be distinguished from 
untransformed plants. For example, the seeds obtained in the manner described above can be 
sown anew and, after growing, subjected to a suitable spray selection. A further possibility is to 
grow the seeds, if necessary after sterilization, on agar plates using a suitable selecting agent in 

15 such a way that only the transformed seeds are able to grow to plants. Further advantageous 
transformation methods in particular of piants are known to the skilled worker and are described 
below. 

The nucleic acid sequences coding for the threonine aldolase used in the process of the 
invention are functionally linked to one or more regulatory signals, advantageously for increasing 

20 gene expression, in the process of the invention. These regulatory sequences are intended to 
make specific expression of the genes and protein expression possible. This may mean, for 
example, depending on the host organism (= transgenic organism, e.g. plant or microorganism), 
that the gene is expressed and/or overexpressed only after induction, or that it is immediately 
expressed and/or overexpressed. Examples of these regulatory sequences are sequences to 

25 which inducers or repressors bind and thus regulate the expression of the nucleic acid. In 

addition to these new regulatory sequences or in place of these sequences it is possible for the 
natural regulation of these sequences still to be present in front of the actual structural genes 
and, where appropriate, to have been genetically modified so that the natural regulation has 
been switched off and the expression of the genes has been increased. The expression 

30 cassette (= expression construct = gene construct = nucleic acid construct) may, however, also 
have a simpler structure, i.e. no additional regulatory signals have been inserted in front of the 
nucleic acid sequence or its derivatives, and the natural promoter with its regulation has not 
been deleted. Instead, the natural regulatory sequence has been mutated so that regulation no 
longer takes place and/or gene expression is increased. These modified promoters can also be 

35 put in the form of partial sequences (= promoter with parts of the nucleic acid sequences of the 
invention) alone in front of the natural gone to increase the activity. The gene construct may 
additionally advantageously also comprise one or more so-called "enhancer sequences" 
functionally linked to the promoter, which make increased expression of the nucleic acid 
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sequence possible. Additional advantageous sequences can also be inserted at the 3' end of the 
DNA sequences, such as further regulatory elements or terminators. The nucleic acid 
sequence(s) coding for the threonine aldolase proteins may be present in one or more copies in 
the expression cassette (= nucleic acid construct). It is advantageous for only one copy in each 
5 case ur the genes to be present in the expression cassette. This nucleic acid construct or the 
nucleic acid constructs may be expressed together in the host organism. It is moreover possible 
for the nucleic acid construct or the nucleic acid constructs to be inserted in one or more vectors 
and be present free in the cell, or else be inserted in the genome, advantageously. In the case of 
plants, integration into the plastid genome or, preferably, into the celf genome can take place. It 
1 0 is advantageous for insertion of further genes ii i the host genome if the genes to be expressed 
are present together in one gene construct. 

Regulatory sequences are usually disposed upstream (5'), within and/or downstream (3') in 
relation to a particular nucleic acid or a particular codogenic gene segment. They control in 
particular the transcription and/or translation, and the transcript stahility of the codogenic gene 
1 5 segment, where appropriate in cooperation with further functional systems intrinsic to the cell, 
such as the protein biosynthesis apparatus of the cell. 

Regulatory sequences include in particular sequences disposed upstream (5'), which relate in 
particular to regulation of transcription initiation, such as promoters, and sequences disposed 
downstream (3'), which relate in particular to regulation of transcription termination, such as 
20 polyadenylation signals. 

Promoters which can be employed are in principle all those able to stimulate transcription of 
genes in organisms such as microorganisms, plants or nonhuman animals. Suitable promoters 
able to function in these organisms are generally known. They may be constitutive or inducible 
promoters. Suitable promoters may in multicellular eukaryotes make development- and/or 
25 tissue-specific expression possible, and it Is thus possible in plants advantageously to use leaf , 
root-, flower-, seed-, guard cell- or fruit-specific promoters. 

The regulatory sequences or factors may moreover, as described above, preferably have a 
positive influence, and thus increase, gene expression of the introduced genes. Thus, the 
regulatory elements can advantageously be strengthened at the level of transcription by using 
30 strong transcription signals such as promoters and/or enhancers. Besides this, however, it is 
also possible to enhance translation by, for example, introducing translation enhancer 
sequences or improving the stability of the mRNA. 

One or more nucleic acid constructs comprising one or more nucleic acid sequences which are 
defined by SEQ ID NO: 1 and code for the polypeptides represented in SEQ ID NO: 2 are a 
35 further embodiment of the invention. One or more nucleic acid constructs comprising one or 
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more nucleic acid sequences which can be derived from the sequences of the invention SEQ ID 
NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 
NO: 9 or SEQ ID NO: 10 are a further advantageous embodiment of the invention. Said 
polypeptides advantageously have threonine aldolase activity. The same applies to their 
5 homologs, derivatives or analogs which are functionally connected to one nr more regulatory 
signals, advantageously to increase gene expression. 

Advantageous regulatory sequences for the novel process are present for example in promoters 
such as the cos, tac, rha, trp, tet, trp-tet, Ipp, lac, Ipp-lac, lacl q \ T7, T5, T3, gal, trc, ara, SP6, 
A-P R nr A-P L promoter, which are advantageously used in Gram-negative bacteria. Further 

10 advantageous regulatory sequences are present for example in the Gram-positive promoters 
amy, dnaK, xylS and SP02, in the yeast or fungus promoters ADC1 , MFa, AC, P-60, UASH, 
MCB, PHO, CYC1 , GAPDH, TEF, rp28, ADH or in the plant promoters CaMV/35S [Franck et al., 
Cell 21 (1980) 285-294, US 5,352,605], PRP1 [Ward et al., Plant. Mol. Biol. 22 (1993)], SSU, 
PGEL1, OCS [Lcioncr and Gelvin (1088) Proc Natl Acad Sci USA 85(5):2553-2557], Iib4, usp, 

15 mas [Comai et al. (1990) Plant Mol Biol 15 (3):373-381], STLS1 , ScBV Schenk et al. (1999) 

Plant Mol Biol 39(6):1221-1230, B33, SAD1 or SAD2 {flax promoters, Jain et al., Crop Science, 
39 (6), 1999; 1696 - 1701) or nos [Shaw et al. (1984) Nucleic Acids Res. 12(20):7831-7846]. It is 
also possible and advantageous to use the various ubiquitin promoters from Arabidopsis [Callis 
et al.(1990) J. Biol. Chem., 205:12406- 12493; Holtorf S ot al. (1995) Plant. Mol. Biol., 29:637- 

20 747], Pinus, corn [(Ubi1 und Ubi2), US 5,510,474; US 6,020,190 and US 6,054574] or parsley 
[Kawalleck et al., Plant Molecular Biology, 21, 1993: 673 - 684] or phaseolin promoter. Likewise 
advantageous in this connection are inducible promoters such as the promoters described in 
EP-A-0 388 186 (benzylsulfonamide-inducible), Piant J. 2, 1992:397^04 (Gatz et al., 
tetracycline-inducible), EP-A-0 335 528 (abcisic acid-inducible) or WO 93/21334 (ethano!- or 

25 cyclohexenol-inducible). Further suitable plant promoters are the promoter of cytosolic FBPase 
or the potato ST-LSI promoter (Stockhaus et al.. EMBO J. 8, 1989, 2445), Glycine max 
phosphoribosyl-pyrophosphate amidotransferase promoter (Genbank access No. U87999) or 
the node-specific promoter described in EP-A-0 249 676. Particularly advantageous promoters 
are promoters which make expression possible in specific tissues or show a preferential 

30 expression in certain tissues. Also advantageous are seed-specific promoters such as the USP 
promoter of the embodiment, but also other promoters such as the LeB4, DC3, SAD1 . phaseolin 
or napin promoter. Further particularly advantageous promoters are seed-specific promoters 
which can be used for monocotyledonous or dicotyledonous plants and are described in 
US 5,608,152 (oilseed rape napin promoter), WO 98/45461 (Arobidopsis oleosin promoter), US 

35 5,504,200 (Phaseolus vulgaris phaseolin promoter ), WO 91/13980 (brassica Bce4 promoter), 
and by Baeumleln et al., Plant J., 2, 2, 1992:233-239 (legume LeB4 promoter), these promoters 
being suitable for dicotyledons. The following promoters are suitable for example for 
monocotyledons barley lpt-2 or lpt-1 promoter (WO 95/15389 and WO 95/23230), barley 
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hordein promoter, the corn ubiquitin promoter and other suitable promoters described in WO 
99/16890. 

It is possible in principle to use all natural promoters with their regulatory sequences, such as the 
abovementioned, for the novel process. It is likewise possible and advantageous to use 
5 synthetic promoters additionally or aione, especially if they mediate seed-specific expression as 
described, for example, in WO 99/16890. 

In order to achieve a particularly effective content of threonine aldolase proteins in transgenic 
plants, the encoded biosynthesis genes can advantageously be expressed constitutively and/or 
seed-, fruit- or tuber-specifically in plants. In a further advantageous embodiment, however, they 

10 may also be inducibly expressed, so that they are induced, and thus expressed, specifically in a 
desired growth phase of the plant. It is possible to use for this purpose seed-specific promoters 
or promoters which are active in the embryo and/or in the endosperm. Seed-specific promoters 
can in principle be isolated both from dicotyledonous and from monocotyledonous plants. 
Advantageous preferred promoters are listed in the following: (JSP {= unknown seed protein) 

15 and vicilin (Vicia faba) [Baumlein et al., Mol. Gen Genet., 1991, 225(3)], napin (oilseed rape) 
[US 5,61)8,152], acyl carrier protein (oilseed rape) [US 5,310,001 and WO 92/16634], olcooin 
(Arabidopsis thaliana) [WO 98/45461 and WO 93/20216], phaseolin (Phaseolus vulgaris) 
[US 5.504,200], Bce4 [WO 91/13980], legume B4 (LegB4 promoter) [Baumlein et al., Plant J., 
2,2, 1992], Lpt2 and Ipt1 (barley) [WO 95/15389 and WO95/23230], seed-specific promoters 

20 from rice, com and wheat [WO 99/16890], Amy32b, Amy 6-6 and aleurain [US 5,677,474], Bce4 
(oilseed rape) [US 5,530,149], glycinin (soybean) [EP 5/1 741], phospnoenolpyruvate 
carboxylase (soybean) [JP 06/62870], ADR12-2 (soybean) [WO 98/08962], isocitrate lyase 
(oilseed rape) [US 5,689,040] or B-amylase (barley) [EP 781 849]. 

Plant gene expression can also be facilitated by a chemically inducible promoter (see a review in 
25 Gatz 1997, Annu. Rev. Plant Physiol. Plant Moi. Biol., 46:89-106). Chemically inducible 

promoters are particularly suitable when it is desired for gene expression to take place in a time- 
specific manner. Examples of such promoters are a salicylic acid-inducibie promoter 
(WO 95/19443), tetracycline-inducible promoter (Gatz et al. (1992) Plant J. 2, 397-404) and 
ethanol-inducible promoter. 

30 Expression specifically in gymnosperms or angiosperms is also possible in principle. 

In order to ensure stable integration of nucleic acid sequences used in the process of the 
invention in combination with further biosynthesis genes in the transgenic plant over several 
generation, each of the nucleic acids which are used in the process and code for the aldolases 
should be expressed under the control of its own, preferably of a different, promoter, because 
35 repeating sequence motifs may lead to instability of the T-DNA or to recombination events or to 
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silencing. The structure of the expression cassette is advantageously such that a promoter is 
followed by a suitable cleavage site for inserting the nucleic acid to be expressed, 
advantageously in a polyllnker subsequently where appropriate a terminator is located behind 
the polylinker. This successive arrangement is repeated a plurality of times, preferably three, 
5 four or five times, so that up to five genes can be combined in a construct and thus be 
introduced for expression into the transgenic plant. The successive arrangement is 
advantageously repeated up to three times. The nucleic acid sequences are inserted for 
expression via the suitable cleavage site, for example in the polyiinker behind the promoter. It is 
advantageous for each nucieic acid sequence to have its own promoter and, where appropriate, 

1 0 its own terminator However, it is also possible for a plurality of nucieic acid sequences to be 
inserted behind a promoter and, where appropriate, in front of a terminator. The insertion site or 
the successive arrangement of the inserted nucleic acids in the expression cassette is not of 
crucial importance, which means that a nucleic acid sequence can be inserted in first or last 
place in the cassette with the expression being negligibly influenced thereby. It is possible and 

1 5 advantageous to use in the expression cassette different promoters such as, for example, the 
USP, the LegB4, the DC3 promoter or the ubiquitin promoter from parsley and different 
terminators, it is, however, also possible to use only one type of promoter in the cassette. This 
may, however, lead to unwanted recombination events or silencing effects. A further 
advantageous nucleic acid sequence which can be expressed in combination with the 

20 sequences used in the process and/or the aforementioned biosynthesis genes is the sequence 
for an ATP/ADP translator as described in WO 01/20009. This ATP/ADP translocator leads to 
an increase in the synthesis of the essential amino acids lysine and/or methionine. 

As described above, the transcription of the introduced genes should advantageously be 
stopped by suitable terminators at the 3' end of the introduced biosynthesis genes (behind the 
25 stop codon). It is possible to use for this purpose, for example, the OCS1 terminator. Just as for 
the promoters, different terminator sequences should be used for each gene here. 

The gene construct may, as described above, also include other genes which are to be 
introduced into the organisms. It is possible and advantageous for regulatory genes, such as 
genes for inducers, repressors or enzymes, which intervene through their enzymic activity in the 

30 regulation of one or more genes of a biosynthetic pathway, to be introduced into the host 

organisms and to be expressed therein. These genes may be of heterologous or homologous 
origin. The nucleic acid construct or gene construct may also advantageously contain further 
biosynthcois genee, or else these genes may be located nn another or a plurality of other nucleic 
acid constructs. Biosynthesis genes advantageously used are genes of amino acid metabolism, 

35 of glycolysis, of tricarboxylic acid metabolism or combinations thereof. 
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It is moreover possible for the aforementioned polypeptides or enzymes to be cloned in 
combination with further genes in the nucleic acid constructs or vectors and be employed for 
transforming microorganisms or plants with the aid of, for example, Agroboctcrium. 

The regulatory sequences or factors may moreover, as described above, preferably have a 
5 positive influence, and thus increase, gene expression of the introduced genes. Thus, the 
regulatory elements can advantageously be strengthened at the level of transcription by using 
strong transcription signals such as promoters and/or enhancers. Besides this, however, it is 
also possible to enhance translation by, for example, introducing translation enhancer 
sequences or improving the stability of the mRNA. The expression cassettes can in principle be 
1 0 used directly for introduction into the plant, or else be introduced into a vector. 

These advantageous vectors, preferably expression vectors, comprise the nucleic acid which 
are used in the process and which code for threonine aldolase proteins, or a nucleic acid 
construct which comprises the nucleic acid used, alone or in combination with further genes 
such as the biosynthesis genes of amino acid metabolism. I he term "vector", as used herein, 

1 5 relates to a nucleic acid molecule which is able to transport another nucleic acid to which it is 
linked. One type of vectnr is a "piasmid" which stands for a circular double-stranded DNA loop 
into which additional DNA segments can be ligated. A further type of vector is a viral vector, in 
which case additional DNA segments can be ligated into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell into which they have been introduced (e.g. 

20 bacterial vectors with bacterial origin of replication). Other preferred vectors are advantageously 
integrated on introduction into the host cell info the genome of a host cell and thus replicated 
together with the host genome. In addition, certain vectors are able to control the expression of 
genes to which they are functionally connected. These vectors are referred to here as 
"expression vectors". As mentioned above, they are capable of autonomous replication or may 

25 be integrated into the host genome. Expression vectors suitable for DNA recombination 
techniques are usually in the form of plasmids. "Piasmid" and "vector" can. be used 
exchangeably in the present description because the piasmid is the most commonly used vector 
form. However, the invention is intended to encompass these other expression vector forms 
such as viral vectors, which exercise similar functions. The term vector is also intended to 

30 encompass other vectors known to the skilled worker, such as phages, viruses such as SV40, 
CMV, TMV, transposons, IS elements, phasmids, phagemids, cosmids, linear or circular DNA. 

The recombinant expression vectors advantageously used in the process include the nucleic 
acids of the invention or the nucleic acid construct of the invention in a form which are suitable 
for expression of the nucleic acids used in a host cell, meaning that the recombinant expression 
35 vectors includes one or more regulatory sequences selected on II ie basis of the host cells to be 
used for the expression, which is functionally connected to the nucleic acid sequence to be 
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expressed. In a recombinant expression vector, "functionally connected" means that the 
nucleotide sequence of interest is iinked to the regulatory sequence(s) in such a way that 
expression of the nucleotide sequence is possible and they are linked to one another so that 
both sequences comply with the predicted function ascribed to the sequence (e.g. in an in vitro 
5 transcription/translation system or in a host cell whan thQ vector is introduced into the host cell). 
The term "regulatory sequence" is intended to include promoters, enhancers and other 
expression control elements (e.g. polyadenylation signals). These regulatory sequences are 
described, for example, in Goeddel: Gene Expression Technology: Methods in Enzymology 1 85, 
Academic Press, San Diego, CA (1990), or see: Gruber and Crosby, in: Methods in Plant 

10 Molecular Biology and Biotechnolgy, CRC Press, Boca Raton, Florida, editors: Glick and 

Thompson, Chapter 7, 89-108, including the references therein. Regulatory sequences include 
those which control constitutive expression of a nucleotide sequence in many types of host ceil, 
and those which control direct expression of the nucleotide sequence only in particular host cells 
under particular conditions. The skilled worker is aware that the design of the expression vector 

15 may depend on factors such as the choice of host cell to be transfomied, the extent of 
expression of the desired protein etc. 

The recombinant expression vectors used may be designed specifically for the expression of 
nucleic acid sequences used in the process in prokaryotic or eukaryotic cells. This is 
advantageous because intermediate steps of vector construction are often carried out for 

20 simplicity in microorganisms. For example, the amino acid genes and/or threonine aldolase 
genes can be expressed in bacterial cells, insect ceils (using baculovirus expression vectors), 
yeast and other fungus celis [see Romanos, MA, et al. (1992) "Foreign gene expression in 
yeast: a review", Yeast 8:423-488; van den Hondel, C.A.M.J.J., et al. (1991) "Heterologous gene 
expression in filamentous fungi", in: More Gene Manipulations in Fungi, J.W. Bennet & 

25 L.L. Lasure, editors, pp. 396-428: Academic Press: San Diego; and van den Hondel, C.A.M.J.J., 
& Punt. P.J. (1991) "Gene transfer systems and vector development for filamentous fungi, in: 
Applied Molecular Genetics of Fungi, Peberdy, J.F., et al., editor, pp. 1-28, Cambridge University 
Press: Cambridge], algae [Falciatore et al., 1999, Marine Biotechnology. 1, 3:239-251] with 
vectors in a transformation process as described in WO 98/01572, and preferably in ceils of 

30 multicellular plants [see Schmidt, R. and Willmitzer, L. (1988) "High efficiency Agrobacterium 

fumefec/ens-mediatcd transformation of Arabidopsis thalisna leaf anri cotyledon explants" Plant 
Cell Rep.:583-586; Plant Molecular Biology and Biotechnology, C Press, Boca Raton, Florida, 
Chapter 6/7, pp. 71-119 (1993); F.F. White, B. Jenes et al., Techniques for Gene Transfer, in: 
Transgenic Plants, Vol. 1, Engineering and Utilization, editors: Kung and R. Wu, Academic 

35 Press (1993), 128-43; Potrykus, Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991), 205-225 
(and references cited therein)]. Suitable host ceils are also discussed in Goeddel, Gcno 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1 990). 
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The sequence of the recombinant expression vector may alternatively be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes usually takes place with vectors containing constitutive or 
inducible promoters which control the expression of fusion or nonfusion proteins. Typical fusion 
5 expression vectors are, inter alia, pGEX {Pharmacia Biotech Inc; Smith, D.B., and Johnson, K.S. 
(1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, 
Piscataway, NJ), in which glutathione S-transferase (GST), maltose E-binding protein and 
protein A, respectively, are fused to the recombinant target protein. 

Examples of suitable inducible nonfusion E. coli expression vectors are, inter alia, pTrc (Amarin 
10 et al. (1988) Gene 69:301-315) and pET 11d [Studier et al., Gene Expression Technology: 

Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60-89], Target gene 
expression from the pTrc vector is based on transcription by host RNA polymerase from a hybrid 
trp-lac fusion promoter. Target gene expression from the pET 1 1d vector is based on 
transcription from a T7-gn1 0-lac fusion promoter which is mediated by a coexpressed viral KNA 
15 polymerase (T7 gn1). This viral polymerase is provided by the host strains BL21 (DE3) or 

HMS174 (DE3) by a resident □ prophage which harbors a T7 gn1 gene under the transcriptional 
control of the lacUV 5 promoter. 

Other vectors suitable in prokaryotic organisms are known to the skilled worker, these vectors 
being, for example, in E. coli pLG338, pACYC184, the pBR series, such as pBR322, the pUC 
20 series such as pUC1 8 or pUC19. the M1 13mp series, pKC30, pRep4, pHS1 , pHS2, pPLc236, 
pMBL24, pLG200, pUR290, plN-lll 113 -B1 , Xgt1 1 orpBdCI, in Streptomyces plJ101, plJ364, 
plJ702 or plJ361. in Bacillus pUB110, pC194 or pBD214, in Corynebacterium pSA77 or pAJ667. 

In a further embodiment, the expression vector is a yeast expression vector. Examples of 
vectors for expression in the yeast S. cerevisiae include pYe desaturase d (Baldari et al. (1 987) 

25 Embo J. 6:229-234), pMFa (Kurjan and Herskowitz (1982) Cell 30:933-943), pJRY88 (Schultz et 
al. (1987) Gene 54:1 13-123) and pYES2 (Invitrogen Corporation, San Diego, CA). Vectors and 
processes for constructing vectors suitable for use in other fungi such as the filamentous fungi 
include those described in detail in: van den Hondel, C.A.M.J.J., & Punt, P.J. [(1991) "Gene 
transfer systems and vector development fnr filamentous fungi, in: Applied Molecular Genetics 

30 of fungi, J.F. Peberdy et al., editors, pp. 1-28, Cambridge University Press: Cambridge; or in: 
More Gene Manipulations in Fungi; J.W. Bennet & L.L. Lasure, editors, pp. 396-428: Academic 
Press: San Diego]. Further suitable yeast vectors are, for example, 2-M, pAG-1, YEp6, YEp13 
or pEMBLYe23. 

Further vectors which may be mentioned by way of example are pALS1 , plL2 or pBB1 16 in fungi 
35 or pLGV23, pGHIac + , pBIN19, pAK2004 or pDH51 in plants. 
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An alternative possibility is to express the nucleic acid sequences in insect cells using 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g. St9 cells) include the pAc series (Smith el al. (1983) Mol. Cell Biol.. 
3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

5 The abovementioned vectors provide only a small review of possible suitable vectors. Further 
plasmids are known to the skilled worker and are described for example in: Cloning Vectors 
(editors Pouweis, P.H., et al., Elsevier, Amsterdam-New York-Oxford, 1985, ISBN 0 444 
904018). For further suitable expression systems for prokaryotic and eukaryotic ceils, see 
Chapters 16 and 17 of Sambrook, J., Fritsch, E.F., and Maniatis, T., Molecular Cloning: A 
10 Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989. 

In a further advantageous embodiment of the process, the nucleic acid sequences can be 
expressed in unicellular plant cells (such as algae), see Falciatore et a!., 1999, Marine 
Biotechnology 1 (3):239-251 and references cited therein, and plant cells from higher plants (e.g. 

15 sperm atophyies such as crops). Examples of plant expression vectors include those described 
in detail in: Becker, D., Kemper, E„ Schell, J., and Masterson, R. [(1992) "New plant binary 
vectors with selectable markers located proximal to the left border", Plant Mol. Biol. 
20:1 195-1 197] and Bevan, M.W. [(1984) "Binary Agrobacterium vectors for plant 
transformation", Nucl. Acids Res. 12:871 1-8721; Vectors for Gene Transfer in Higher Plants; in: 

20 Transgenic Plants, Vol. 1 , Engineering and Utilization, editors: Kung and R. Wu, Academic 
Press, 1993, pp. 15-38]. A review of binary vectors and thoir use is also to be found in 
Hellens, R., Mullineaux, P. and Klee H., [(2000) " A guide to Agrobacterium binary vectors, 
Trends in Plant Science, Vol. 5 No. 10, 446-151. 

A plant expression cassette preferably comprises regulatory sequences able to control gene 
25 expression in plant cells and functionally connected so that each sequence is able to comply 
with its function, such as termination and transcription, for example polyadenylation signals. 
p re f errer j polyadenylation signals are those derived from Agrobacterium tumefaciens T-DNA, 
such as the gene 3 known as octopine synthase of the Ti plasmid pTiACH5 (Gielen et al., EMBO 
J. 3 (1984) 835ff.) or functional equivalents thereof, but al! other terminators functionally active in 
30 plants are also suitable. 

Since plant gene expression is very often not restricted at the levels of transcription, a plant 
expression cassette preferably comprises other functionally connected sequences such as 
translation enhancers, for example the overdrive sequence which comprises the 5'-untranslated 
leader sequence from tobacco mosaic virus which increases the protein/RNA ratio (Gallie et al., 
35 1987, Nucl. Acids Research 15:8693-8711). 
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For expression in plants, the nucleic acid sequences must, as described above, be functionally 
connected to a suitable promoter which carries out gene expression in a timely, cell- or tissue- 
specific manner. Promoters which can be used are uunstitutive promoters (Benfey et a!., EMBO 
J. 8 (1989) 2195-2202), such as those derived from plant viruses such as 35S CAMV (Franck et 
S at.. Cell 21 (1980) 285-294), 19S CaMV (see also US 5352605 and WO 84/02913), 34S FMV 
(Sanger et al., Plant. Mol. Biol., 14, 1990: 433 - 443), the parsley ubiquitin promoter or plant 
promoters such as that of the rubisco small subunit described in US 4,962,028. 

Other preferred sequences for use for functional connection in plant gene expression cassettes 
are targeting sequences which are necessary for guiding the gene product into its appropriate 
10 cell compartment (see a review in Kermode, Crit. Rev. Plant Sci. 15, 4 (1996) 285-423 and 
references cited therein), for example into the vacuoles, the cell nucleus, all types of plastids 
such as amyloplasts, chloroplasts, chromoplasts, the extracellular space, the mitochondria, the 
endoplasmic reticulum, elaioplast, peroxisomes and other compartments of plant cells. 

Plant gene expression can also be facilitated as described above by a chemically inducible 
15 promoter (see a review in Gatz 1997, Annu. Rev. Plant Physiol. Plant Mol. Biol., 48:89-108). 
Chemically inducible promoters are particularly suitable when time-specific gene expression is 
desired. Examples of such promoters are a salicylic acid-inducible promoter (WO 95/1 9443), a 
tetracycline-inducible promoter (Gatz et al. (1992) Plant J. 2, 397-404) and an ethanol-inducible 
promoter. 

20 Promoters which respond to biotic or abiotic stress conditions are also suitable promoters, for 
example the pathogen-induced PRP1 gene promoter (Ward et al., Plant. Mol. Biol. 22 (1993) 
361-366), the heat-inducible tomato hspBO promoter (US 5,187,267), the cold-inducible potato 
alpha-amylase promoter (WO 96/12814) or the pinll promoter which is inducible by wounding 
(EP-A-0 375 091). 

25 Particularly preferred promoters are those which bring about gene expression in tissues and 
organs in which amino acid biosynthesis takes place, in seed cells such as the cells of the 
endosperm and of the developing embryo. Suitable promoters are the oilseed rape napin gene 
promoter (US 5,608,152), the Vicia faba USP promoter (Baeumlein et al., Mol Gen Genet, 1991, 
??fi (3)-459-67), the Arabidopsis oleosin promoter (WO 98/45461), the Phaseoius vulgaris 

30 phaseolin promoter (US 5,504,200), the brassica Bce4 promoter (WO 91/13980), the bean arc5 
promoter, the carrot DcG3 promoter or the legumin B4 promoter (LeB4; Baeumlein et al., 1992, 
Plant Journal, 2 (2):233-9) and promoters which bring about seed-specific expression in 
monocotyledonous plants such as corn, barley, wheat, rye, rice etc. Advantageous seed-specific 
promoters are the sucrose binding protein promoter (WO 00/2638ft), the phaseolin promoter 

35 and the napin promoter. Suitable promoters worthy of note are the barley Ipt2 or lpt1 gene 
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promoter (WO 95/1 5389 and WO 95/23230) or those described in WO 99/1 6890 (promoters 
from the barley hordein gene, the rice glutelin gene, the rice oryzin gene, the rice prolamin gene, 
the wheat gliadin gene, wheat gluteiin gene, the corn zein gene, the oats glutelin gene, the 
sorghum kasirin gene, the rye secalin gene). 

5 In particular, multtparallel expression of the nucleic acids used in the process may be desired, 
alone or in combination with other genes or nucleic acids. Such expression cassettes can be 
introduced via simultaneous transformation of a plurality of individual expression constructs or, 
preferably, by combining a plurality of expression cassettes on one construct. It is also possible 
for a plurality of vectors to be transformed each with a plurality of expression cassettes and be 
1 0 transferred to the host cell. 

Promoters which bring about plastid-specific expression are likewise particularly suitable. 
Suitable promoters such as the virai RNA polymerase promoter are described in WO 95/16783 
and WO 97/06250, and the Arabidopsis dpP promoter is described in WO 99/46394. 

For strong expression of heterologous sequences in as many tissues as possible, especially 
15 including leaves, besides various of the abovementioned viral and bacterial promoters, 
preferably plant promoters of actin or ubiquitin genes such as, for example, the rice actinl 
promoter are used. The sugar beet V-ATPase promoters (WO 01/14572) represent a further 
example of constitutive plant promoters. Examples which should be mentioned uf synthetic 
constitutive promoters are the super promoter (WO 95/14098) and promoters derived from G 
20 boxes (WO 94/12015) A further possibility in some circumstances is also to utilize chemically 
inducible promoters, compare EP-A 388186, EP-A 335528, WO 97/06268. Also available for 
expression of genes in plants are leaf-specific promoters as described in DE-A, 19644478, or 
photoregulated promoters such as, for example, the pea petE promoter. 

Of the polyadenyiation signals, particular mention should be made of the Poly-A addition 
25 sequence from the ocs gene or nos gene of Agrobacterium tumefaciens. Further regulatory 
sequences which are expedient where appropriate also include sequences which control the 
transport and/or the localization of the expression products (targeting). In this connection, 
mention should be made particularly of the signal peptide- or transit peptide-encoding 
sequences known per se. For example, if is possible with the aid of plastid transit peptide- 
30 encoding sequences to guide the expression product into the plastids of a plant cell. Plants 
particularly preferred as recipient plants are, as described above, those which can be 
transformed in an expedient manner. These include mono- and dicotyledonous plants. Particular 
mention should be made of agricultural crop plants such as cereals and grasses, e.g. Triticum 
spp., Zea mays, Hordeum vulgare, I lafer, Secale cerealc, Oryza sativa, Penntsetum glaucum, 
35 Sorghum bicolor, Triticaie, Agrostis spp., Cenchrus ciliaris, Dactylis glomerata, Festuca 
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arundinacea, Loiium spp., Medicago spp. and Saccharum spp., legumes and oilseed crops, e.g. 
Brassica juncea, Brassica napus, Glycine max, Arachis hypogaea, Gossypium hirsutum, Cicer 
arietinum, Helianthus annuus, Lens culinaris, Linum usitatissimum, Sinapis alba, Trifolium 
repens and Vicia narbonensis, vegetables and fruits, e.g. bananas, grapes, Lycopersicon 
5 esculentum, asparagus, cabbage, water melons, kiwis. Solanum tuberosum, Beta vulgaris, 
cassava and chicory, trees, e.g. Coffea species, Citrus spp., Eucalyptus spp., Picea spp., Pinus 
spp. and Populus spp., medicinal plants and trees, and flowers. In a particular embodiment, the 
present invention relates to transgenic plants of the genus Arabidopsis, e.g. Arabidopsis thaliana 
and of the genus Oryza. 

1 0 Vector DNA can be introduced into prokaryotic or eukaryotic cells by conventional transformation 
or transfection techniques. The terms "transformation" and "transfection", conjugation and 
transduction, as used herein, are intended to include a large number of processes known in the 
art for introducing foreign nucleic acid (e.g. DNA) into a host cell, including calcium phosphate or 
calcium chloride coprecipitation. DEAE-dextran-mediated transfection, PEG-mediated 

15 transfection, lipofection, natural competence, chemically mediated transfer, electroporation or 
particle bombardment. Processes suitable for the transformation or transfection of host cells, 
including plant cells, are to be found in Sambrook et al. (Molecular Cloning: A Laboratory 
Manual, 2nd edition, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) and other laboratory handbooks such as Methods in Molecular 

20 Bioiogy, 1995, Vol. 44, Agrobacterium protocols, editors: Gartland and Davey, Humana Press, 
Totowa, New Jersey. 

The term "nucleic acid (molecule or sequence)", as used herein, may additionally include the 
untranslated sequence located at the 3' end and at the 5' end of the coding gene region: at least 
500, preferably 200, particularly preferably 100, nucleotides of the sequence upstream of the 5' 

25 end of the coding region and at least 100, preferably 50, particularly preferably 20, nucleotides of 
the sequence downstream of the 3' end of the coding gene region. It is advantageous to take 
only the coding region for cloning and expression. An "isolated" nucleic acid molecule is 
separated from other nucleic acid molecules present in the natural source of the nucleic acid. An 
"isolated" nucleic acid preferably has no sequences which naturally flank the nucleic acid in the 

30 genomic DNA of the organism from which the nucleic acid is derived (e.g. sequences located at 
the 5' and 3' ends of the nucleic acid). In various embodiments, the isolated nucleic acid 
molecule used in the process of the invention may comprise for example fewer than about 5 kb, 
4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb ur 0.1 kb of nucleotide sequences which naturally flank the nucleic 
acid molecule in the genomic DNA of the cell from which the nucleic acid is derived. 

35 The nucleic acid molecules used in the process, e.g. a nucleic acid molecule having a nucleotide 
sequence of SEQ ID NO:1 or of a part thereof, can be isolated by use of standard techniques of 
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molecular biology and the sequence information provided herein. It is also possible with the aid 
of comparison algorithms to identify for example a homologous sequence or homologous, 
conserved sequence regions at the DNA or amino acid level. These can be used as 
hybridization probe and standard hybridization techniques (as described, for example, in 
5 Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) for isolating 
further nucleic acid sequences useful in the process. Moreover, a nucleic acid molecule 
comprising a complete sequence of SEQ !D NO: 1 or a part thereof can be isolated by 
polymerase chain reaction using oligonucleotide primers based on this sequence or parts 

1 0 thereof (e.g. a nucleic acid molecule comprising the complete sequence or a part thereof can be 
isolated by polymerase chain reaction using oligonucleotide primers constructed on the basis of 
this same sequence). For example, mRNA can be isolated from cells (e.g. by the guanidinium 
thiocyanate extraction process of Chirgwin et al. (1979) Biochemistry 18:5294-5299) and cDNA 
can be prepared using reverse transcriptase (e.g. Moloney MLV reverse transcriptase obtainable 

15 from Cibco/BRL, Bcthesda, MD, or AMV reverse transcriptase obtainable from Seikagaku 

America, Inc., St. Petersburg, FL). Synthetic oligonucleotide primers for amplification using the 
polymerase chain reaction can be designed on the basts of one of the amino acid sequences 
depicted in SEQ ID NO: 1 or with the aid of the amino acid sequences depicted in SEQ ID NO: 
2. A further possibility is to identify, by protein sequence comparisons of threonine aldolases 

20 from various organisms, conserved regions from which in turn degenerate primers can then be 
derived, These degenerate primers can then be utilized for amplifying fragments of new 
threonine aldolases from other organisms by PCR. These fragments can then be utilized as 
hybridization probe for isolating the complete gene sequence. An alternative possibility is to 
isolate the missing 5' and 3' sequences by means of RACE-PCR. A nucleic acid of the invention 

25 can be amplified using cDNA or, alternatively, genomic DNA as template and suitable 

oligonucleotide primers in standard PCR amplification techniques. The nucleic acid amplified in 
this way can be cloned into a suitable vector and characterized by DNA sequence analysis. 
Oligonucleotides corresponding to a nucleotide sequence used in the process can be prepared 
by standard synthetic processes, for example using an automatic DNA synthesizer. 

30 Nucleic acid molecules advantageous for the process of the invention can be isolated on the 
basis of their homology with the nucleic acids disclosed herein, using the sequences or a part 
thereof as hybridization probe in standard hybridization techniques under stringent hybridization 
conditions. In these cases it is possible for example to use isolated nucleic acid molecules which 
are at least 15 nucleotides long and hybridize under stringent conditions with trie nucleic acid 

35 molecules comprising a nucleotide sequence of SEQ ID NO: 1. Nucleic acids of at least 25, 50, 
100, 250 or more nucleotides nan also be used. The term "hybridizes under stringent 
conditions", as used herein, is intended to describe hybridization and washing conditions under 
which nucleotide sequences which are at least 60% homologous with one another usually 
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remain hybridized together. The conditions are preferably such that sequences which are at 
least about 65%, more preferably at least about 70% and even more preferably at least about 
75% or more homologous with one another usually remain hybridized together. Homolog or 
homology mean for the purposes of the invention identical or identity. These stringent conditions 
5 are known to the skilled worker and can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N. Y. (1989), 6.3.1-6.3.6. A preferred, non-restrictive example of stringent 
hybridization conditions are hybridizations in 6 x sodium chloride/sodium citrate (= SSC) at about 
45°C, followed by one or more washing steps in 0.2 x SSC, 0.1% SDS at 50 to 65°C. The skilled 
worker is aware that these hybridization conditions differ according to the type of nucieic acid 

10 and, if for example organic solvents are present, with regard to the temperature and 

concentration of the buffer. The temperature differs for example under "standard hybridization 
conditions" depending on the type of nucleic acid between 42°C and 58°C in aqueous buffer with 
a concentration of from 0.1 to 5 x SSC (pH 7.2). If organic solvent is present in the 
abovementioned buffer, for example 50% formamide, the temperature under standard 

15 conditions is about 42°C. The hybridization conditions for DNA:DNA hybrids are preferably for 
example 0.1 x SSC and 20°C to 45°C, preferably between 30°C and 45°C. The hybridization 
conditions for DNA:RNA hybrids are preferably for example 0.1 x SSC and 30°C to 55°C, 
preferably between 45°C and 55°C. The aforementioned hybridization temperatures are 
intended for example for a nucleic acid with a length of about 100 bp {= base pairs) and a G + C 

20 content of 50% In the absence of formamide. The skilled worker is aware of how the necessary 
hybridization conditions can be determined from textbooks such as the aforementioned or from . 
the following textbooks Sambrook et a!., "Molecular Cloning", Cold Spring Harbor Laboratory, 
1989; Hames and Higgins (editors) 1985, "Nucleic Acids Hybridization: A Practical Approach", 
IRL Press at Oxford University Press, Oxford; Brown (editors) 1991 , "Essential Molecular 

25 Biology: A Practical Approach", IRL Press at Oxford University Press, Oxford. 

To determine the percentage homology (= identity) of two amino acid sequences (e.g. of SEQ ID 
NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID 
NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10) or of two nucleic acids (e.g. of sequence 
SEQ ID NO: 1), the sequences are aligned for optimal comparison purposes (e.g. gaps can be 

30 introduced in the sequence of one protein or nucleic acid to produce optimal alignment with the 
other protein or other nucleic acid). The amino acid residues or nucleotides at the corresponding 
amino acid positions or nucleotide positions are then compared. When a position in one 
sequence is occupied by the same amino acid residue or the same nucleotide as the 
corresponding position in the other sequence, then the molecules are homologous at this 

35 position (i.e. as used herein amino acid or nucleic acid "homology" is equivalent to amino acid or 
nucleic acid "identity"). The percentage homology hetween the two sequences is a function of 
the number of identical positions shared by the sequences (i.e. % homology = number of 
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identical positions/total number of positions x 100). The terms homology and identity are thus to 
be regarded as synonymous. 

An isolated nucleic acid molecule coding for a threonine aldolase homologous to a protein 
sequence of SEQ ID NO: 2 or the sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
5 SEQ !D NO: 6, SEQ ID NO: 7, SEQ !D NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10 can be 
generated by introducing one or more nucleotide substitutions, additions or deletions into a 
nucleotide sequence of SEQ ID NO: 1 or into the nucleic acid sequences derived from the 
aforementioned amino acid sequences so that one or more amino acid substitutions, additions 
or deletions are introduced into the encoded protein. Mutations can be introduced into one of the 

10 sequences of SEQ ID NO: 1 by standard techniques, such as site-specific mutagenesis and 

PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are produced at 
one or more of the predicted nonessential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced by an amino acid residue having 
a similar side chain. Families of amino acid residues having similar side chains have been 

1 5 defined in the art. These families include amino acids having basic side chains (e.g. lysine, 

arginlne, histidine), acidic side chains (e.g. aspartic acid, glutamic acid), uncharged polar side 
chains (e.g. glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolarside 
chains (e.g. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), 
beta-branched side chains (e.g. threonine, valine, isoleucine) and aromatic side chains (e.g. 

20 tyrosine, phenylalanine, tryptophan, histidine). A predicted nonessential amino acid residue in a 
protein sequence such as SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO. 5, SEQ ID 
NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10 is thus preferably 
replaced by another amino acid residue from the same side-chain family. Alternatively, in 
another embodiment, the mutations can be introduced randomly along all or part of the coding 

25 sequence, e.g. by saturation mutagenesis, and the resulting mutants can be screened for their 
biological activity, i.e. amino acid production, in order to identify mutants which retain the 
biological activity or have increased it. After mutagenesis of one of the sequences of SEQ ID 
NO: 1 or of the nucleic acid sequence which can be derived from the aforementioned 
sequences, the encoded protein can be expressed recombinantly, and the activity of the protein 

30 can be determined for example using the assays described herein. 

Homologs of the nucleic acid sequences used with the sequence SEQ ID NO: 1 or the nucleic 
acid sequences derived from the sequences SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ 
ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 1 0 mean, for example, 
allelic variants having at least about 30 to 50%, preferably at least about 50 to 70%, more 
35 preferably at least about 70 to 80%, 80 to 90% or 90 to 95% and even more preferably at least 
about 95%, 96%, 97%, 98%, 99% or more homology with a nucleotide sequences shown in 
SEQ !D NO: 1 or the aforementioned derived nucleic acid sequences or their homologs. 
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analogs or parts thereof. In addition, isolated nucleic acid molecules of a nucleotide sequence 
which hybridize onto one of the nucleotide sequences shown in SEQ ID NO: 1, the derived 
nucleic acid sequences or a part thereof, e.g. under stringent conditions, are hybridised. Allelic 
variants include in particular functional variants which can be obtained by deletion, insertion or 
5 substitution of nucleotides from/in the sequence depicted in SEQ ID NO: 1 or the derived nucleic 
acid sequences, the intention being, however, that the enzymic activity or the biological activity 
of the synthesized proteins originating therefrom advantageously be retained for the insertion of 
one or more genes. Proteins which still have the essential enzymatic activity of threonine 
aldolase, i.e. their activity is negligibly reduced, means proteins having at least 10%, preferably 
10 20%, particularly preferably 30%, very particularly preferably 40%. nf the original biological or 
enzymic activity, advantageously compared with the protein encoded by SEQ ID NO: 2. 

Homologs of SEQ ID NO: 1 or of the derived sequences also mean, for example, bacterial, 
fungal and plant homoiogs, truncated sequences, single-stranded DNA or RNA of the coding 
and noncoding DNA sequence. 

15 Homologs of SEQ ID NO: 1 or of the derived sequences also mean derivatives such as, for 
example, promoter variants. The promoter upstream of the indicated nucleotide sequences 
may be modified by one or more nucleotide exchanges, by insertion(s) and/or deletion(s) 
without, however, impairing the functionality or activity of the promoters. It is additionally possible 
for the activity of the promoters to be increased by modifying their sequence, or for them to be 

20 completely replaced by more active promoters, even from heterologous organisms. 

The aforementioned nucleic acids and protein molecules having aldolase activity which are 
involved in the amino acid metabolism are used to increase the yield, production and/or 
efficiency of production of a desired compound or a decrease in unwanted compounds. 

The organisms used in the process of the invention are grown or cultured in a manner known to 
25 the skilled worker depending on the host organism. Microorganisms are ordinarily grown in a 
liquid medium which contains a carbon source, usually in the form of sugars, a nitrogen source, 
usually in the form of organic nitrogen sources such as yeast extract or salts such as ammonium 
sulfate, trace elements such as iron, manganese, magnesium salts and, where appropriate, 
vitamins, at temperatures between 0°C and 100°C. preferably between 10°C to 60°C, while 
30 passing in oxygen. The pH of the nutrient liquid can be kept at a fixed value during this, i.e. 
controlled during the cultivation, or not. The cultivation can be carried out batchwise, 
semibatchwise or continuously! Nutrients can be introduced at the start of the fermentation or be 
subsequently fed in semicontinuously or continuously. The produced amino acids can be 
Isolated from the organisms by processes known to the okilled worker. For example by 
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extraction, salt precipitation and/or ion exchange chromatography. The organisms may also for 
this purpose be disrupted beforehand. 

The process of the invention is, when the host organisms are microorganisms, advantageously 
carried out at a temperature between 0°C to 95°C, preferably between 10°C to 85°C, particularly 
5 preferably between 1 5°C to 75"C, very particularly preferably between 1 5°C to 45°C. 

The pH is advantageously kept at between pH 4 and 12, preferably between pH 6 and 9, 
particularly preferably between pH 7 and 8, during this. 

The process of the invention can be operated batchwise, semibatchwise or continuously. A 
summary of known cultivation methods is to be found in the textbook by Chmiel 
10 (BioprozeUtechnik 1 . Einfuhrung in die Bioverfahrenstechnik (Gustav Fischer Verlag, Stuttgart, 
1991 )) or in the textbook by Storhas (Bioreaktoren und periphere Einrichtungen (Vieweg Verlag, 
Braunschweig/Wiesbaden, 1994)). 

The culture medium to be used must meet the requirements of the respective strains in a 
suitable manner. Descriptions of culture media for various microorganisms are present in the 
1 5 handbook "Manual of Methods for General Bacteriology" of the American Society for 
Bacteriology (Washington D. C, USA, 1981). 

These media which can be employed according to the invention include, as described above, 
usually one or more carbon sources, nitrogen sourcus, inuiyanic salts, vitamins and/or trace 
elements, 

20 Preferred carbon sources are sugars such as mono-, di- or polysaccharides. Examples of very 
good carbon sources are glucose, fructose, mannose, galactose, ribose, sorbose, ribulose, 
lactose, maltose, sucrose, raffinose, starch or cellulose. Sugars can also be added to the media 
via complex compounds such as molasses, or other byproducts of sugar refining. It may also be 
advantageous to add mixtures of various carbon sources. Other possible carbon sources are 

25 oils and fats such as, for example, soybean oil, sunflower oil, peanut oil and/or coconut fat, fatty 
acids such as, for example, palmitic acid, stearic acid and/or linoleic acid, alcohols and/or 
polyalcohols such as, for example, glycerol, methanol and/oi elhanol and/or organic acids such 
as, for example, acetic acid and/or lactic acid. 

Nitrogen sources are usually organic or inorganic nitrogen compounds or materials which 
30 contain these compounds. Examples of nitrogen sources include ammonia in liquid or gaseous 
form or ammunium salts such as ammonium sulfate, ammonium chloride, ammonium 
phosphate, ammonium carbonate or ammonium nitrate, nitrates, urea, amino acids or complex 
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nitrogen sources such as corn steep liquor, soybean meal, soybean protein, yeast extract, meat 
extract and others. The nitrogen sources may be used singly or as a mixture. 

Inorganic salt compounds which may be present in the media include the chloride, phosphorus 
or sulfate salts of calcium, magnesium, sodium, cobalt, molybdenum, potassium, manganese, 
5 zinc, copper and iron. 

For preparing sulfur-containing fine chemicals, in particular methionine, it is possible to use as 
sulfur source inorganic sulfur-containing compounds such as, for example, sulfates, sulfites, 
dithionites, tetrathionates, thiosulfates, sulfides or else organic sulfur compounds such as 
mercaptans and thiols. 

10 It is possible to use as phosphorus source phosphoric acid, potassium dihydrogenphosphate or 
dipotassium hydrogenphosphate or the corresponding sodium-containing salts. 

Chelating agents can be added to the medium in order to keep the metal ions in solution. 
Particularly suitable chelating agents include dihydroxyphenols such as catechol or 
protocatechuate, or organic acids such as citric acid. 

1 5 The fermentation media employed according to the invention for cultivating microorganisms 
normally also contain other growth factors such as vitamins or growth promoters, which include, 
for example, biotin, riboflavin, thiamine, folic acid, nicotinic acid, panthothenate and pyridoxine. 
Growth factors and salts are often derived from complex media components such as yeast 
extract, molasses, corn steep liquor and Hie like. Suitable precursors can moreover be added to 

20 the culture medium. The exact composition of the media compounds depends greatly on the 
particular experiment and is chosen individually for each specific case. Information about media 
optimization is obtainable from the textbook "Applied Microbiol. Physiology, A Practical 
Approach" (editors P.M. Rhodes, P.F. Stanbury, IRL Press (1997) pp. 53-73, ISBN 0 19 963577 
3), Growth media can also be purchased from commercial suppliers such as Standard 1 (Merck) 

25 or BHI (Brain heart infusion, DIFCO) and the like. 

All media components are sterilized either by heat (1.5 bar and 121°C for 20 min) or by 
sterilizing filtration. The components can be sterilized either together or, If necessary, separately. 
Alt media components can be present at the start of the cultivation or optionally be added 
continuously or batchwise. 

30 The temperature of the culture is normally between 15°C and 45°C, preferably at 25°C to 40°C, 
and can be kept constant or changed during the experiment. The pH of the medium should be in 
the range Fium 5 to 8.5, preferably around 7. The pH for the cultivation can be controlled during 
the cultivation by adding basic compounds such as sodium hydroxide, potassium hydroxide, 
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ammonia or aqueous ammonia or acidic compounds such as phosphoric acid or sulfuric acid. 
Foaming can be controlled by employing antifoams such as, for example, fatty acid polyglycol 
esters. The stability of plasmlds can be maintained by adding to the medium suitable subotances 
having a selective effect, for example antibiotics. Aerobic conditions are maintained by 
5 introducing oxygen or oxygen-containing gas mixtures such as, for example, ambient air into the 
culture. The temperature of the culture is normally from 20°C to 45°C and preferably from 25°C 
to 40°C. The culture is continued until formation of the desired product is at a maximum. This 
aim is normally achieved within 10 hours to 160 hours. 

The fermentation broths obtained in this way, containing in particular L-methionine and/or L- 
1 0 lysine, normally have a dry matter content of from 7.5 to 25% by weight. 

Sugar-limited fermentation is additionally advantageous, at least at the end, but especially over 
at least 30% of the fermentation time. This means that the concentration of utilizable sugar in the 
fermentation medium is kept at, or reduced to, > 0 to 3 g/l during this time. 

The fermentation broth is then processed further. Depending on requirements, the biomass can 
15 be removed entirely or partly by separation methods, such as, for example, centrifugation, 
filtration, decantation or a combination of these methods, from the fermentation broth or left 
completely in it. 

The fermentation broth can then be thickened or concentrated by known methods, such as, for 
example, with the aid of a rotary evaporator, thin-film evaporator, falling film evaporator, by 
20 reverse osmosis or by nanofiltration. This concentrated fermentation broth can then be worked 
up by freeze drying, spray drying, spray granulation or by other processes. 

However, it is also possible to purify the amino acid further. For this purpose, the product- 
containing broth after removal of the biomass is subjected to a chromatography on a suitable 
resin, in which case the desired product or the impurities are retained wholly or partly on the 
25 chromatography resin. These chromatography steps can be repeated if necessary, using the 
same or different chromatography resins. The skilled worker is familiar with the choice of 
suitable chromatography resins and their most effective use. The purified product can be 
concentrated by filtration or ultrafiltration and stored at a temperature at which the stability of the 
product is a maximum. 

30 The identity and purify nf the isolated compound(s) can be determined by prior art techniques. 
These include high performance liquid chromatography (HPLC), spectroscopic methods, 
staining methods, thin-layer chromatography, NIRS, enzyme assay or microbiological assays. 
These analytical methods are summarized in: Patek et al. (1994) Appi. Environ. Microbiol. 
60:133-140; Malakhova et al. (1996) Biotekhnologiya 1 1 27-32; and Schmidt et al. (1998) 



Metanomics GmbH & Co. KGaA 



20020960 



PF 54195 



42 

Bioprocess Engineer. 19:67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) Vol. A27, 
VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 575-581 and pp. 581-587; 
Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John 
Wiley and Sons; Fallon, A. et al. (1 987) Applications of HPLC in Biochemistry in: Laboratory 
5 Techniques in Biochemistry and Molecular Biology, Vol. 17. 

The amino acids obtained in the process are suitable as starting material for synthesizing further 
products of value. They can be used for example in combination with one another or alone for 
producing drugs, human foods, animal feeds or cosmetics. 

The transfer of foreign genes into the genome of a plant is referred to, as described above, as 

1 0 transformation. In this case, the methods described for transformation and regeneration of 

plants from plant tissues or plant cells are utilized for transient or stable transformation. Suitable 
methods are protoplast transformation by polyethylene glycol-induced DNA uptake, the biolistic 
method with the gene gun - the so-called particle bombardment method, electroporation, 
incubation of dry embryos in DNA-containing solution, microinjection and Agrobacterium- 

15 mediated gene transfer. Said processes are described, for example, in B. Jenes et al., 

Techniques for Gone Transfer, in: Transgenic Plants, Vol. 1, Engineering and Utilization, edited 
by S.D. Kung and R. Wu, Academic Press (1993) 128-143 and in Potrykus Annu. Rev. Plant 
Physiol. Plant Moiec. Biol. 42 (1991) 205-225). The construct to be expressed is preferably 
cloned into a vector which is suitable for transforming Agrobacterium tumefaciens, for example 

20 pBtn19 (Bevan et al., Nucl. Acids Res. 12 (1984) 8711). Agrobacteria transformed with such a 
vector can 'then be used in a known manner for transforming plants, especially crop plants, such 
as, for example, tobacco plants, by, for example, bathing wounded leaves or pieces of leaves in 
a solution of agrobacteria and then cultivating in suitable media. Transformation of plants with 
Agrobacterium tumefaciens is described for example by Hofgen and Willmitzer in Nucl. Acid 

25 Res. (1988) 16, 9877 or is disclosed inter alia in F.F. White, Vectors for Gene Transfer in 

Higher Plants; in Transgenic Plants, Vol. 1, bngineering and Utilization, edited by S.D. Kung and 
R. Wu, Academic Press, 1993, pp. 15-38. 

Marker genes are advantageously used for selection for successful introduction of the nucleic 
acids of the invention into a host organism. These marker genes make it possible to identify 

3U successful introduction Of the nucleic acids of the invention by a number of different principles, 
for example by visual recognition with the aid of fluorescence, luminescence or in the 
wavelength range of light which is visible to humans, via a herbicide or antibiotic resistance, via 
so-called nutritional (auxotrophic markers) or antinutritional markers, by enzyme assays or via 
phyto hormones. Examples of such makers which may be mentioned here are the GFP (= green 

35 fluorescent protein); the luciferin/luceferace system; p-galactosidase with its colored substrates 
e.g. X-Gal; herbicide resistances to, for example, imidazolinone, glyphosate, phosphothricin or 
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sulfonylurea; antibiotic resistances to, for example, bleomycin, hygromycin, streptomycin, 
kanamycin, tetracycline, chloramphenicol, ampicillin, gentamicin, geneticin (G418), 
spectinomycin or blasticidin to mention only a few, nutritional markers such as utilization of 
mannose or xylose or antinutritionai markers such as 2-deoxyglucose resistance. This list 
5 represents a small section of possible markers. Markers of these types are well known to the 
skilled worker. Different markers are preferred, depending on organism and selection method. 

It is known about stable or transient integration of nucleic acids in plant cells that, depending on 
the expression vector used and transfection technique used, only a small part of the cells takes 
up the foreign DNA and, if desired, integrates it in their genome. For identification and selection 

1 0 of these integrants, usually a gene which encodes a selectable marker (e.g. antibiotic 
resistance) is introduced together with the gene of interest into the host cells. Preferred 
selectable markers include in plants those which confer resistance to a herbicide such as 
giyphosphate or glufosinate. Further suitable markers are, for example, markers which encode 
genes which are involved in biosynthetic pathways of, for example, sugars or amino acids, such 

1 5 as fi-galactosidase, ura3 or ilv2. Markers encoding genes such as luciferase, gfp or other 

fluorescence genes are likewise suitable. These markers can be used in mutants in which these 
genes are not functional because, Tor example, they have been deleted by conventional 
methods. Markers encoding a nucleic acid encoding a selectable marker can moreover be 
introduced into a host cell on the same vector as that coding for the threonine aldolases used in 

20 the process, or can be introduced on a separate vector. Cells stably transfected with the 
introduced nucleic acid can be identified for example by selection (e.g. cells which have 
integrated the selectable marker survive, whereas the other cells die). 

Since, usually, the marker genes, specifically the antibiotic and herbicide resistance gene, are 
no longer required or are unwanted in the transgenic host cell after successful introduction of the 

25 nucleic acids, techniques making it possible to delete or excise these marker genes are 

advantageously used in the process of the Invention lui introducing the nucleic acids. One such 
method is so-called cotransformation. In cotransformation, two vectors are used simultaneously 
for the transformation, one vector harboring the nucleic acids of the invention and the second 
one harboring the marker gene(s). A large part of the transformants acquires or contains both 

30 vectors in the case of plants (up to 40% of the transformants and more). It is then possible to 

remove the marker genes from the transformed plant by crossing. A further method uses marker 
genes integrated into a transposon for the transformation together with the desired nucleic acids 
(so-called Ac/Ds technology). In some cases (about 10%). after successful transformation the 
transposon jumps out of the genome of the host cell and is lost, in a further number of cases, 

35 the transposon jumps into another site. In these cases, outcrossing of the marker gene again is 
necessary. Microbiological techniques enabling or facilitating detection of such events have 
been developed. A further advantageous method uses so-called recombination systems which 
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have the advantage that it is possible to dispense with outcrossing. The best-known system of 
this type is the so-called Cre/lox system. Cre1 is a recombinase which deletes the sequences 
located between the loxP sequence. If the marker gene is integrated between the loxP 
sequence, it is deleted by expression of the recombinase after successful transformation. 
5 Further recombinase systems are the HIN/HIX, the FLP/FRT and the REP/STB systems {Tribble 
et al., J.Biol. Chem., 275, 2000: 22255 - 22267; Velmurugan et al., J. Cell Biol., 149, 2000: 553 - 
566). Targeted integration of the nucleic acid sequences of the invention into the plant genome 
is also possible in principle but less preferred because of the large amount of work involved. 
These methods are, of course, also applicable to microorganisms such as yeasts, fungi or 
10 bacteria. 

Agrobacteria transformed with an expression vector of the invention can likewise be used in a 
known manner for transforming plants such as test plants such as Arabidopsis or crop plants 
such as, for example, cereals, corn, oats, rye, barley, wheat, soybean, rice, cotton, sugar beet, 
canola, sunflower, flax, hemp, potato, tobacco, tomato, carrot, paprika, oilseed rape, tapioca, 
15 cassava, arrowroot, tagetes, alfalfa, lettuce and the various tree, nut and grape species, 

especially oil-containing crop piants such as soybean, peanut, castor oil plant, sunflower, corn, 
cotton, flax, oilseed rape, coconut, oil palm, safflower (Carthamus tincturius) or cocoa bean, e.g. 
by bathing wounded leaves or pieces of leaves in a solution of agrobacteria and then cultivating 
in suitable media. 

20 The genetically modified plant cells can be regenerated by all methods known to the skilled 
worker. Appropriate methods can be found in the obovcmentioned publications by S.D. Kung 
and R. Wu, Potrykus or Hofgen and Willmitzsr. 

Besides the transformation of somatic cells, which must then be regenerated to plants, it Is also 
possible to transform cells of plant meristems and, in particular, those cells which develop into 

25 gametes. In this case, the transformed gametes lead to transgenic plants by the route of natural 
plant development. Thus, for example, seeds of Arabidopsis are treated with agrobacteria, and 
seeds are obtained from the plants developing therefrom, which seeds show a certain 
transformation rate and are therefore transgenic ( Feldman, KA and Marks MD (1987), 
Agroibacferium-mediated transformation of germinating seeds of Arabidopsis thaliana: a non 

30 tissue culture approach. Mol Geii Genet 208:274-209; Teldmann K (1 092) T DNA insertion 
mutagenesis in Arabidopsis: seed infection transformation. In C Koncz, N-H Chua and J Shell, 
eds. Methods in Arabidopsis Research. Word Scientific, Singapore, pp. 274-289). Alternative 
methods are based on repeated removal of the inflorescences and incubation of the severed 
site in the center of the rossette with transformed agrobacteria, likewise making it possible to 

35 obtain transformed seeds iater (Chang, SS, Park SK, Kim, BC, Kang, BJ, KimDU and Nam, HG 
(1994) Stable genetic transformation of Arabidopsis thaliana by Agrobacterium inoculation in 
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plants. Plant J. 5: 551-558; Katavic, V, Haughn, GW, Reed, D, Martin, M and Kunst, L (1994) In 
plants transformation of Arabidopsis thaiiana.Uo\ Gen Genet, 245: 363-370). However, the 
method of vacuum infiltration with its modifications such as floral dip is particularly efficient. In 
the vacuum infiltration of Arabidopsis, whole plants are treated with a suspension of 
5 agrobactsrium in vacuo (Bechthold, N, Ellis, J. and Pelletier, G (1993) In planta Agrobacterium- 
mediated gene transfer by infiltration of adult Arabidopsis thalina plants. C R Acad Sci Paris Life 
Sci, 316: 1 194-1 199), while in the floral dip method the developing flower tissue is briefly 
incubated in a suspension of agrobacteria mixed with a surfactant (Clough, SJ and Bent, AF 
(1998) Floral dip: a simple method for Agratecfen'um-mediated transformation of Arabidopsis 
10 tlmliana. The Plant J. 16, 735-743). in both cases, a certain percentage of transgenic seeds are 
harvested and can be distinguished from non-transgenic seeds by cultivation under the selective 
conditions described above. 

A further aspect of the invention therefore relates to transgenic organisms transformed with at 
least one nucleic acid sequence or expression cassette of the invention or with a vector of the 

15 invention, and to cells, cell cultures, tissues, parts - such as, for example in the case of plant 

organisms, leaves, roots, etc. - or propagation material derived from such organisms. The terms 
"host organism", "host cell", "recombinant (host) organism", "recombinant (host) cell", 
"transgenic (host) organism" and "transgenic (host) cell" are used interchangeably herein. It is 
self-evident that these terms relate not only to the particular host organism or to the particular 

20 target cell but also to the progeny or potential progeny of these organisms or cells. Since certain 
modifications may occur in subsequent generations owing to mutation or environmental effects, 
these progeny are not necessarily identical to the parental cell but are still included within the 
scope of the term as used herein. 

The amino acid sequences classified in SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
25 NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10 are a further aspect of 
the invention. 

This invention is illustrated further by the following examples, which are not to be regarded as 
restrictive. The contents of all the references, patent applications, patents and published patent 
applications cited in this patent application are incorporated herein by reference. 

30 Examples: 

Example 1: Cloning of SEQ ID NO: 1 into Escherichia coll 

SEQ ID NO: 1 was cloned by well-known and well-established methods (see, for example, 
Sambrook, J. et al. (1989) "Molecular Cloning: A Laboratory Manual". Cold Spring Harbor 
Laboratory Press or Ausubel, F.M. et al. (1994) "Current Protocols in Molecular Biology", John 
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Wiiey & Sons) into the piasmids pBR322 (Sutcliffe, J.G. (1979) Proc. Natl Acad. Sci. USA, 75: 
3737-3741); pACYC177 (Change & Cohen (1978) J. Bacterid. 134: 1141-1156); piasmids of the 
pBS series (pBSSK+, pBSSK- and others; Stratagene, LaJoHa, USA) or cosmids such as 
SuperCosI (Stratagene, LaJoila, USA) or Lorist6 (Gibson, T.J. Rosenthal, A., and Waterson, 
5 R.H. (1987) Gene 53: 283-286) for expression in E. coli. 

Example 2: DNA sequencing and computer function analysis 

The DNA sequencing was carried out by standard methods, in particular the chain termination 
method with ABI377 sequencers (see, for example, Fleischman, R.D. et al. (1995) "Whole- 
genome Random Sequencing and Assembly of Haemophilus influenzae Rd. ", Science 269; 
10 496-512). 

Example 3: In vivo mutagenesis 

Mutagenesis of Corynebacterium glutamicum in vivo can be carried out by passing a plasmid (or 
other vector) DNA through E. coli or other microorganisms (e.g. Bacillus spp. or yeasts such as 
Saccharomyces cerevisiae) unable to maintain the integrity of their genetic information. Usual 
15 mutator strains have mutations in the genes for the DNA repair system [e.g. mutHLS, mutD, 
mutT, etc., for comparison, see Rupp, W.D. (1996) DNA repair mechanisms in Escherichia coli 
and Salmonella, pp. 2277-2294, ASM: Washington]. These strains are known in the skilled 
worker. The use of these strains is explained for example in Greener, A. and Callahan, M. 
(1994) Strategies 7; 32-34. 

20 Example 4: DNA transfer between Escherichia coli and Corynebacterium glutamicum 

Several Corynebacterium and Brevibacterium species contain endogenous piasmids (such as, 
for example, pHM1519 or pBL1) which undergo autonomous replication (for a review, see, for 
example, Martin, J.F, et al. (1987) Biotechnology G: 137-146). Shuttle vectors for Eochorichia coli 
and Corynebacterium glutamicum can easily be constructed by means of standard vectors for 

25 E. coli (Sambrook, J. et al., (1989), "Molecular Cloning: A Laboratory Manual", Cold Spring 

Harbor Laboratory Press or Ausubel, F.M. et al. (1994) "Current Protocols in Molecular Biology", 
John Wiley & Sons), to which an origin of replication for and a suitable marker from 
Corynebacterium glutamicum is added. Such origins of replication are preferably taken from 
endogenous piasmids isolated from Corynebacterium and Brevibacterium species. Particularly 

30 ueed as transformation markers for these species are genes for kanamycin resistance (such as 
those derived from the Tn5 or Tn-903 transposoh) or for chloramphenicol (Winnacker, E.L. 
(1987) "From Genes to Clones - Introduction to Gene Technology, VCH, Weinheim). There are 
numerous examples in the literature of the preparation of a large number of shuttle vectors 
which are replicated in E. coli and C. glutamicum, and which can be used for various purposes, 
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including gene overexpression (see, for example, Yoshihama, M. et a!. (1985) J. Bacteriol. 162: 
591-597, Martin, J.F. eial., (1987) Biotechnology, 5: 137-146 and Eikmanns, B.J. et al. (1992) 
Gene 102: 93-98). Suitable vectors which replicate in coryneform bacteria are, for example, pZ1 
(Menkel et al., Appl. Environ. Microbiol., 64, 1989: 549 - 554), pEkExl (Eikmanns et al., Gene 
5 102, 1991 : 93 - 98) or pH32-1 (Sonnen et al, Gene 107, 1001 : 60 74). These vectors are 
based on the crytic plasmids pHM1519, pBL1 or pGA1 . Other plasmid vectors such as, for 
example, those based on pCG4 (US 4,489,160), pNG2 (Serwold-Davis et al., FEMS Microbiol. 
Lett., 66, 1990: 1 19 - 124) or pAG1 (US 5,158,891) can be used in a similar way. 

It is possible by standard methods to clone a gene of interest into one of the shuttle vectors 
10 described above, and to introduce such hybrid vectors into Corynebacterium glutamicum strains. 
Transformation of C. glutamicum can be achieved by protoplast transformation (Kastsumata, R. 
et al., (1984) J. Bacteriol. 159, 306-311), electroporation (Liebl, E. et., (1989) FEMS Microbiol. 
Letters, 53: 399-303) and, in cases where specific vectors are used, also by conjugation (as 
described, for example, in Schafer, A., et (1000) J. Bacteriol. 1 72: 1663-1666). It is likewise 
1 5 possible to transfer the shuttle vectors for C. glutamicum to E. coli by preparing plasmid DNA 
from C. glutamicum (by standard methods known in the art) and transforming it into E. coli. This 
transformation step can take place using standard methods, but an Mcr-deficient E. coli strain is 
advantageously used, such as NM522 (Gough & Murray (1983) J. Mol. Biol. 166: 1-19). 

If it is intended, advantageously, that the transformed sequence(s) be integrated into the 
20 genome of the coryneform bacteria, standard techniques for this are also known to the skilled 
worker. For example, plasmid vectors like those described by Remscheid et al. (Appl. Environ. 
Microbiol., 60, 1994: 126 - 132) for the duplication or amplification of the hom-thrB operon are 
used for this purpose. In this method, the complete gene is cloned into a plasmid vector able to 
replicate in a host such as E. coli but not in C. glutamicum. Examples of suitable vectors are 
25 pSUP301 (Simon et al., Bio/Technology 1, 1983: 784 - 791), pKIBmob or pK19mob (Schafer et 
al., Gene 145, 1994: 69 - 73), pGEM-T (Promega Corp., Madison, Wl, USA), pCR2.1-TOPO 
(Schuman, J. Biol. Chem., 269, 1994: 32678 - 32684, US 5,487,993), pCR®Blunt (from 
Invitrogen, Groningen, The Netherlands) or pEM1 (Schrumpf et al., J. Bacteriol., 173, 1991 : 
4510-4516). 

30 Example 5: Determination of the expression of the mutant/transgenic protein 

Observations of the activity of a mutated or transgenic protein in a transformed host cell are 
based on the fact that the protein is expressed in a similar way and in similar quantity to the wild- 
type protein. A suitable method for determining the transcription rate of the mutant or transgenic 
gene (an indicator of the quantity of mRNA available for translation of the gene product) is to 
35 carry out a Northern blot (see, for example, Ausubel et al., (1988) Current Protocols in Molecular 
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Biology, Wiley: New York), where a primer which is designed so that it binds to the gene of 
interest is provided with a detectable (usually radioactive or chemiluminescent) label so that - 
when the complete RNA is extracted from a culture of the organism, fractionated on a gel, 
transferred to a stable matrix and incubated with this probe - the binding and the quantity of the 
5 binding of the probe indicates the presence and also the quantity of mRNA for this gene. This 
information is a demonstration of the extent of transcription of the gene. Complete cellular RNA 
can be isolated from Corynebacterium glutamicum by various methods known in the art, as 
described in Bormann, E.R. et al., (1992) Mol. Microbiol. 6: 317-326. 

The presence or the relative quantity of protein' translated from this mRNA can be determined by 
10 employing standard techniques such as Western blotting (see, for example, Ausubel et al. 
(1988) "Current Protocols in Molecular Biology", Wiley, New York). In this method, all cellular 
proteins are extracted, separated by gel electrophoresis, transferred to a matrix such as 
nitrocellulose, and incubated with a probe, such as an antibody, which binds specifically to the 
desired protein This probe is usually provided directly or indirectly with a chemiluminescent or 
1 5 colorimetric label which can easily be detected. The presence and the observed quantity of 
labels indicates the presence and the quantity of the mutant protein which is sought in the cell. 

Example 6; Growth of genetically modified Corynebacterium glutamicum - media and 
cultivation conditions 

Genetically modified corynebacteria are cultured in synthetic or natural growth media. A number 
of different growth media for corynebacteria are known and easily obtainable (Lieb et al. (1989) 
Appl. Microbiol. Biotechnol. 32: 205-210; von der Osten et al. (1998) Biotechnology Letters 1 1 : 
1 1-16; Patent DE 4 120 867; Liebl (1992) "The Genus Corynebacterium", in: The Procaryotes, 
Vol. II, Balows, A., et al., editors, Springer-Verlag). These media consist of one or more carbon 
sources, nitrogen sources, inorganic salts, vitamins and trace elements. Preferred carbon 
soufues are sugars such as mono-, di- or polysaccharides. Examples of very good carbon 
sources are glucose, fructose, mannose, galactose, ribose, sorbose, ribulose, lactose, maltose, 
sucrose, raffinose, starch or cellulose. Sugars can also be added to the media via complex 
compounds such as molasses, or other byproducts of sugar refining. It may also be 
advantageous to add mixtures of various carbon sources. Other possible carbon sources are 
alcohols and/or organic acids such as methanol, ethanol, acetic acid ur lactic acid. Nitrogen 
sources are usually organic or inorganic nitrogen compounds or materials which contain these 
compounds. Examples of nitrogen sources include ammonia gas, aqueous ammonia solutions 
or ammonium saits such as NH 4 CI or (NH 4 ) 2 S04, NH 4 OH, nitrates, urea, amino acids or 
complex nitrogen sources such as corn steep liquor, soybean meal, soybean protein, yeast 
extracts, meat extracts and others. Mixtures of the aforementioned nitrogen sources may also 
advantageously be used. 



25 



30 
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Inorganic salt compounds which may be present in the media include the chloride, phosphorus 
or sulfate salts of calcium, magnesium, sodium, cobalt, molybdenum, potassium, manganese, 
zinc, copper and iron. Chelating agents can be added to the medium in order to keep the meta! 
ions in solution. Particularly suitable chelating agents include dihydroxyphenols such as catechol 
5 or protocalechuale, or organic acids such as citric acid. The media normally also contain other 
growth factors such as vitamins or growth promoters, which include, for example, biotin, 
riboflavin, thiamine, folic acid, nicotinic acid, pantothenate and pyridoxine. Growth factors and 
salts are often derived from complex media components such as yeast extract, molasses, corn 
steep liquor and the like. The exact composition of the media compounds depends greatly on 
10 the particular experiment and is chosen Individually for each specific case. Information about 
media optimization is obtainable, for example, from the textbook "Applied Microbiol. Physiology, 
A Practical Approach" (editors P.M. Rhodes, P.P. Stanbury, IRL Press (1997) pp. 53-73, ISBN 0 
19 963577 3). Growth media can also be purchased from commercial suppliers such as 
Standard 1 (Merck) or BHI (Brain heart infusion, DIFCO) and the like. 

15 All media components are sterilized either by heat (1.5 bar and 121 °C for 20 min) or by 

sterilizing filtration. The components can be sterilized either together or, if necessary, separately. 
All media components can be present at the start of the cultivation or optionally be added 
continuously or batchwise. 

The cultivation conditions are defined separately for each experiment. The temperature should 
20 be between 15°C and 45°C and can be kept constant or changed during the experiment. The pH 
of the medium should be in the range from 5 to 8.5, preferably around 7.0, and can be 
maintained by adding buffers to the media. One example of a buffer for this purpose is a 
potassium phosphate buffer. Synthetic buffers such as MOPS. HEPES; ACES etc. can be used 
alternatively or simultaneously. The cultivation pH can be kept constant during the cultivation 
25 also by adding, for example, NaOH or NH4OH. If complex media components such as yeast 
extract are used, the requirement for additional buffers is reduced because many complex 
compounds have a high buffer capacity. If a fermenter is used for cultivating microorganisms, 
the pH can also be controlled with gaseous ammonia. 

The incubation time is usually in a range from several hours up to several days. This time is 
30 selected so that the maximum quantity of product accumulates in the fermentation broth. The 
disclosed growth experiments can be carried out in a large number of containers such as 
microtiter plates, glass tubes, glass flasks or glass or metal fermenters of various si7fis For 
screening a large number of clones, the microorganisms should be cultured in microtiter plates, 
glass tubes or shaker flasks either with or without baffles. 100 ml shaker flasks are preferably 
35 used and are charged with 10% (based on volume) of the required growth medium. The flasks 
should be shaken on an orbital shaker (amplitude 25 mm) with a speed in the range from 100- 
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300 rpm. Evaporation losses can be reduced by maintaining a moist atmosphere; alternatively, a 
mathematical correction should be carried out for the evaporation losses. 

If genetically modified clones are investigated, there should also be testing of an unmodified 
control clone or a control clone which contains the basic plasmid without insert. If a transgenic 
5 sequence is to be expressed, in this case too a control clone should also advantageously be 
tested. The medium is advantageously inoculated to an OD600 of 0.5-1.5, using cells cultured 
on agar plates, such as CM plates (10 g/l glucose, 2.5 g/l NaCI, 2 g/l urea, 10 g/i polypeptone, 
5 g/l yeast extract, 5 g/l meat extract, 22 g/l agar, pH 6.8 with 2 M IMaOH) which have been 
incubated at 30°C. The media are inoculated either by introducing a saline solution of 
10 C. glutamicum cells from CM plates or by adding a liquid preculture of this bacterium. 

Example 7: In vitro analysis of the function of the proteins encoded by the transformed 
sequences 

Determination or the activities and kinetic parameters of enzymes is well known in the art. 
Experiments for determining the activity of a particular modified enzyme must be adapted to the 

1 5 specific activity of the wild-type enzyme, which is within the capabilities of the skilled worker. 
Reviews of enzymes in general and specific details relating to the structure, kinetics, principles, 
methods, applications and examples of the determination of many enzymic activities can be 
found for example in the following references: Dixon, M., and Webb, E.C: (1979) Enzymes, 
Longmans, London; Fersht (1985) Enzyme Structure and Mechanism, Freeman, New York; 

20 Walsh (1979) FnTymatic Reaction Mechanisms. Freeman, San Francisco; Price, N.C., Stevens, 
L. (1982) Fundamentals of Enzymology. Oxford Univ. Press: Oxford; Boyer, P.D: editor (1983) 
The Enzymes, 3rd edition, Academic Press, New York; Bisswanger, H. (1994) Enzymkinetik, 
2nd edition, VCH, Weinheim (ISBN 3527300325); Bergmeyer, H.U., Bergmeyer, J., GralJl. M. 
editors (1983-1986) Methods of Enzymatic Analysis, 3rd edition, Vol. I-XM, Verlag Chemie: 

2fj Weinheim; and Ullmann's Encyclopedia of Industrial Chemistry (1987) Vol A9, "Enzymes", 
VCH, Weinheim, pp. 352-363. 

Example 8: Analysis of the influence of the nucleic acids on the production of the amino acids 

The effect of the genetic modification in C. glutamicum on the production of an amino acid can 
be determined by culturing the modified microorganisms under suitable conditions (such as 

30 those described above) and investigating the medium and/or the cellular components for the 

increased production of the amino acid. Such analytical techniques are well known to the skilled 
worker and include spectroscopy, thin-layer chromatography, staining methods of various types, 
enzymatic and microbiological methods, and analytical chromatography such as high 
performance liquid chromatography (see, for example, Ullman, Encyclopedia of Industrial 

35 Chemistry, Vol. A2, pp. 89-90 and pp. 443-613, VCH: Weinheim (1985); Fallon, A., et al., (1987) 
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"Applications of HPLC in Biochemistry" in: Laboratory Techniques in Biochemistry and Molecular 
Biology, Vol. 17; Rehm et al. (1993) Biotechnology, Vol. 3, Chapter IN: "Product recovery and 
purification", pp. 469-714, VCH: Weinheim; Belter, P.A. eta!. (1988) Biuseparations: 
downstream processing for Biotechnology, John Wiley and Sons; Kennedy, J.F. and Cabral, 
5 J.M.S. (1992) Recovery processes for biological Materials, John Wiley and Sons; Shaeiwitz, J.A. 
and Henry, J.D. (1988) Biochemical Separations, in Ullmann's Encyclopedia of industrial 
Chemistry, Vol. B3; Chapter 11, pp. 1-27, VCH: Weinheim; and Dechow, F.J. (1989) Separation 
and purification techniques in biotechnology, Noyes Publications). 

In addition to measurement of the final product of the fermentation, it is likewise possible to 
1 0 analyze other components of the metabolic pathways used to produce the desired compound, 
such as intermediates and byproducts, in order to determine the overall productivity of the 
organism, the yield and/or the efficiency of production of the compound. The analytical methods 
include measurements of the quantities of nutrients in the medium (e.g. sugars, hydrocarbons, 
nitrogen sources, phosphate and other ions), measurements of the biomass composition and of 
1 5 growth, analysis of the production of usual metabolites from biosynthetic pathways and 
measurements of gases generated during the fermentation. Standard methods for these 
measurements are described in Applied Microbial Physiology; A Practical Approach, P.M, 
Rhodes and P.F. Stanbury, editors, IRL Press, pp. 103-129; 131-163 and 165-192 (ISBN: 
0199635773) and the references indicated therein. 

20 Example 9: Purification of the amino acid from C. glutamicum culture 

The amino acid can be obtained from C. glutamicum cells and/or from the supernatant of the 
culture described above by various methods known in the art. For this purpose, firstly the culture 
supernatant is obtained, for which purpose the cells are harvested from the culture by slow 
centrifugation, and the cells can subsequently be fragmented or lysed by standard techniques 

25 such as mechanical force or oonication. The cell detritus is removed hy centrifugation. and the 
supernatant fraction is taken together with the culture supernatant for further purification of the 
amino acid. However, it is also possible to work up the supernatant alone if the concentration of 
the amino acid contained in the supernatant is sufficient. The amino acid or the amino acid 
mixture can then be further purified by, for example, an extraction and/or salt precipitation or by 

30 an ion exchange chromatography. 

If necessary and desired, further chromatography steps with a suitable resin may follow, with the 
amino acid either being retained on the chromatogrpahy resin, but many impurities in the sample 
not, or with the impurities remaining on the resin, but the sample with the product (amino acid) 
not. These chromatography steps may be repeated if necessary, using the same or different 
35 chromatography resins. The skilled worker is familiar with the selection of suitable 
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chromatography resins and the most effective use for a particular molecule to be purified. The 
purified product can be concentrated by filtration or ultrafiltration and stored at a temperature at 
which the stability of the product is a maximum. 

Many purification methods are known in the art and are not confined to the foregoing purification 
5 method. These are described for example in Bailey, J.E. & Ollis, D.F. Biochemical Engineering 
Fundamentals, McGraw-Hill: New York (1986). 

The identity and purity of the isolated amino acid can be determined by standard techniques of 
the art. These include high performance liquid chromatography (HPLC), spectroscopic methods, 
staining methods, thin-layer chromatography, NIRS, enzyme assay or microbiological assays. 

10 These analytical methods are summarized in: Patek et al. (1994) Appl. Environ. Microbiol. 60: 
133 140; Malakhova et al. (1996) Biotekhnologiya 11 : 27-32; and Schmidt et al. (1998) 
Bioprocess Engineer. 19: 67-70. Ulmann's Encyclopedia of Industrial Chemistry (1996) Vol. A27, 
VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, pp. 559-566, 575-581 and pp.- 581 -587; 
Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John 

1 5 Wiley and Sons; Fallon, A. et al. (1987) Applications of HPLC in Biochemistry in: Laboratory 
Techniques in Biochemistry and Molecular Biology, Vol. 17. 

Example 10: Cloning of SEQ ID: No 1 for expression in plants 

Unless indicated otherwise, standard methods from Sambrook et al., Molecular Cloning: A 
laboratory manual, Cold Spring Harbor 1989, Cold Spring Harbor Laboratory Press, are used. 

20 The PCR amplification of SEQ ID: No 1 took place in accordance with the protocol for Pfu Turbo 
DNA polymerase (from Stratagene). The composition was as follows: 1x PCR buffer [20 mM 
Tris-HCI ( P H 8.8), 2 mM MgS0 4 , 10 mM KCI, 10mM (NH 4 )S0 4 , 0.1% Triton X-100, 0.1 mg/ml 
BSA], 0.2 mM d-Thio-dNTP and dNTP (1:125), 100 ng of genomic DNA from Saccharomyces 
cerevisiae (strain S288C; from Research Genetics, Inc., now Invitrogen), 50 pmol of forward 

25 primer, 50 pmol of reverse primer, 2.5 u of Pfu Turbo DNA polymerase. The amplification cycles 
were as follows: 

1 cycle at 95°C for 3' followed by 36 cycles each of 1' 95 D C, 45" 50°C, and 210" 72°C, followed 
by 1 cycle at 72 °C for 8', then 4°C . 

The following primer sequences were chosen for the gene of SEQ ID: No 1 : 

30 i) forward primer (SEQ ID NO:1) 

5' GGAATTCCAGCTGACCACCATGACTGAATTCGAATTGCCTCCAA 
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ii) reverse primer (SEQ ID NO:1) 

5-GATCCCCGGGAATTGOCATGTCAGTATTTGTAGGTTTTTATTTCGC 

The amplicon was subsequently purified on QIAquick coiumns in accordance with a standard 
protocol (from Qiagen). 

5 Restriction of the vector DNA (30 ng) was cut with EcoR\ and Smal by the standard protocol, 
and the EcoRI cleavage site was filled in by the standard protocol (MBI-Fermentas) and stopped 
by adding high-salt buffer. The cut vector fragments were purified on Nucleobond columns by 
the standard protocol (Machery-Nagel). A binary vector containing a selection cassette 
(promoter, selection marker, terminator) and an expression cassette with promoter, cloning 

10 cassette and terminator sequence between the T-DNA border sequences was used. The binary 
vector has no EcoRI and Smal cleavage sites except in the cloning cassette. Employable binary 
vectors which can be used are known to the skilled worker, and a review of binary vectors and 
their use is given by Hellens, R., Mullineaux, P. and Klee H., (2000) A guide to Agrobacterium 
binary vectors, Trends in Plant Science, Vol. 5 No. 10, 446-^51. The cloning is also 

15 advantageously possible with other restriction enzymes, depending on the vector used. 
Appropriate advantageous cleavage sites can be attached to the ORF by using appropriate 
primers for the PCR amplification. 

About 30 ng of prepared vector and a defined quantity of prepared amplicon were mixed and 
ligated by adding ligase. 

20 Transformation of the ligated vectors took place in the same reaction vessel by adding 

competent £ coli cells (strain DH5alpha) and incubating at 1°C for 20', followed by a heat shock 
at 42°C for 90" and cooling to 4°C. This was followed by addition of complete medium (SOC) 
and incubation at 37°C for 45'. The entire mixture was then plated out on an agar plate with 
antibiotics (selected according to the binary vector used) and incubated at 3T°C overnight. 

25 Successful cloning was checked by amplification using primers which bind upstream and 

downstream of the restriction cleavage site and thus make amplification of the insert possible. 
The amplification took place in accordance with the Taq DNA polymerase protocol (Gibco-BRL). 
The composition was as follows: ix PCR buffer [20 mM Tris-HCL (pH 6.4), 1 .5 mM MgCI 2 , 
50 mM KCI, 0.2 mM dNTP, 5 pmol forward primer, 5 pmol reverse primer, 0.625 u Taq DNA 

30 polymerase 

The amplification cycles were as follows: 1 cycle at 94°C for 5', followed by 35 cycles each of 
15" 94°C, 15" 66"C and 5' 72°C, followed by 1 cycle at 72°C for 10', then A°C . 
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Several colonies were checked, and only a colony for which a PCR product of the expected size 
was detected was used further. 

An aliquot of this positive colony was transferred into a reaction vessel filled with complete 
medium (LB) and incubated at 37°C overnight. For selection of the clone, the LB medium 
5 contained an antibiotic which was selected according to the binary vector used and the 
resistance gene present therein. 

The plasmid preparation took place as stated in the Qiaprep standard protocol (Qiagen). 

Example 1 1 : Production of transgenic plants expressing SEQ ID: NO 1 

1 ng of the isolated plasmid DNA was transformed by electroporation into competent cells of 
10 Agrobacterium tumefaciens, for example the strain GV 3101 pMP90 (Koncz and Schell, Mol. 
Gen. Gent. 204, 383-396, 1986). The selection of the agrobacterium strain depends on the 
choice of the binary vector. A review of possible, strains and their properties is to be found in 
Hellens, R., Mullineaux, P. and Klee H., (2000) A guide to Agrobacterium binary vectors, Trends 
in Plant Science, Vol. 5 No. 10, 446-451 . This was followed by addition of complete medium 
1 5 (YEP) and transfer into a new reaction vessel for 3 h at 28°C. The complete mixture was then 
plated out on YEP agar plates with the respective antibiotics, e.g. rifampicin and gentamycin for 
GV3101 pMP90, and a further antibiotic for selecting for the binary vector, and incubated at 
28°C for 48 h. 

The agrobacteria with the plasmid construct generated in bxample lu were then used for plant 
20 transformation. 

A colony was picked off the agar plate using a pipette tip and taken up in 3 ml of liquid TB 
medium which also contained appropriate antibiotics depending on the agrobacterium strain and 
binary plasmid. The preculture grew al 28°C and 120 rpm for 40 h. 

400 ml of LB medium which contained the same antibiotics as previously were used for the main 
25 culture. The preculture was transferred into the main culture. The latter grew at 28°C and 
120 rpm for 18 h. After centrifugation at 4000 rpm, the pellet was resuspended in infiltration 
medium (MS medium, 10% sucrose). 

To cultivate the plants for the transformation, dishes (Piki Saat 80, green with perforated bottom, 
30 x 20 x 4.5 cm, from Wiesauplast, Kunststofftechnik, Germany) were half-filled with a GS 90 
30 substrate (standard soil, Werkverband E.V., Germany). The dishes were watered overnight with 
0.05% Previcur solution ( Praviciir N, Aventis CrapScience). Arabidopsis thaliana C24 seeds 
(Nottingham Arabidopsis Stock Centre, UK ; NASC Stock N906) were scattered on the dish, 
about 1000 seeds per dish. The dishes were covered with a hood for the stratification (8 h, 
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110// //mol/m 2 /s" 1 , 22°C; 16 h, dark, 6°C). After 5 days, the dishes were placed in the short-day 
phytotron ( 8 h, 130 ^mol/mV 1 , 22°C; 16 h, dark, 20°C). They remained here for about 10 days 
until the first true leaves were formed. 

The seedlings were transferred into pots containing the same substrate {Teku pots, 10 cm, LC 
5 series, manufactured by Poppelmann GmbH&Co, Germany). Nine plants were pricked out into 
one pot. The pots were then again placed in the short-day phytotron for further growth. 

After 10 days, the plants were then put in the greenhouse cubicle (additional illumination, 16 h, 
340 p.E, 22°C; 8 h, dark, 20"C). They grew here for a further 17 days. 

Six-week-old, just flowering Arabidopsis plants were transformed by dipping in the suspension of 
1 0 agrobacteria described above for 1 0 sec. The latter had previously been mixed with 1 0 uJ of 
Silwett L77 (Crompton S.A., Osi Specialties, Switzerland). The corresponding method is 
described in Clough and Bent, 1998 (Clough, JC and Bent, AF. 1998 Floral dip: a simplified 
method for Agrofcactem/m-mediated transformation of Arabidopsis thaliana. Plant J. 16:735-743 

The plants were then laid out in a humidity chamber for 18 h. The pots were subsequently 
1 5 returned to the greenhouse for further growth. The plants remained there for 1 0 weeks until 
harvesting of the seeds was possible. 

Depending on the resistance marker used for selecting the transformed plants, the harvested 
seeds were sown in a greenhouse and subjected to spray selection or else, after sterilization, 
cultivated on agar plates with the appropriate selecting agent. After about 10-14 days, the 
20 transformed resistant plants differed distinctly from the dead wild-type seedlings and could be 
pricked out into 6 cm pots. The seeds of the transgenic A thaliana plants were stored in a 
freezer (at -20°C). 



Example 12: Cultivation of plants for bioanalytical investigations 

25 For bioanalytical investigation of the transgenic plants they were grown uniformly in a special 
cultivation. For the soil mixture, the GG-90 substrate was put in a potting machine (Laibie 
System GmbH, Singen, Germany) and used to fill pots. 35 pots were then placed together in one 
dish and treated with Previcur. 25 ml of Previcur were taken up in 10 I of tapwater for the 
treatment. This quantity was sufficient to treat about 200 pots. The pots were placed in the 

30 Previcur solution and additionally watered from above with tapwater without Previcur. The seeds 
were sown on the same day. 
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For sowing, the seeds which had been stored in the refrigerator (at -20°C) were removed from 
the Eppendorf tubes using a toothpick and transferred into the pots containing the soil. In total, 
about 5-12 seeds were distributed in the middle of the pot. 

After sowing, the dishes with the pots were covered with a matching plastic hood and placed in a 
5 stratification chamber in the dark at 4°C for 4 days. The humidity was about 80-90%. After the 
stratification, the test plants were cultivated with a 16 h of light and 8 h of dark rhythm at 20°C, a 
humidity of 60% and a CO z concentration of 400 ppm for 22-23 days. The light source 
comprised Osram Powerstar HQI-T 250 W/D Daylight lamps which produce light of a color 
spectrum similar to that of the sun with a light intensity of 220 u£/m 2 /s" 1 . 

1 0 The plants were subjected at an age of 8, 9 and 1 0 days to selection for the resistance marker. 
After a further 3-4 days, it was then possible clearly to differentiate the transgenic, resistant 
seedlings (small plants in the four-leaf stage) from the untransformed plants. The non- 
transgenic seedlings were bleached or dead. The transgenic resistant plants were singled out at 
the age of 14 days. The plants which showed the best growth in the middle of the pot were 

1 5 regarded as target plant. All the other plants were carefully removed with metai tweezers and 
discarded. 

During growth, the plants were watered with distilled water from above (onto the soil) and from 
below into the channels. The grown plants were then harvested at art aye of 23 days. 

Example 13: Metabolic analysis of transformed plants 

20 The changes, identified according to the invention, in the contents of described metabolites were 
identified by the following method. 

a) Sampling and storage of samples 

Sampling took place directly in the phytotron chamber. The plants were cut with small laboratory 
scissors, rapidly weighed on a laboratory balance, transferred into a precooled extraction thimble 
25 and placed in an aluminum rack cooled by liquid nitrogen. If necessary, the extraction thimbles 
can be stored in a freezer at -80°C. The time from cutting of the plant to freezing in liquid 
nitrogen wao not more than 10-20 s. 

b) Freeze drying 



Care was taken that, during the experiment, the plants either remained in the deep-frozen state 
30 (temperatures < -40°C) or had water removed by freeze drying before the first contact with 
solvents. 
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The aluminum rack with the plant samples in the extraction thimbles was placed in the 
precooled {-40°C) freeze dryer. The initial temperature during the main drying was -~35°C, and 
the pressure was 0.120 mbar. During the drying, the parameters weie changed in accordance 
with a pressure and temperature program. The final temperature after 12 hours was +30°C, and 
5 the final pressure was 0.001 to 0.004 mbar. After the vacuum pump and refrigeration had been 
switched off, the system was ventilated with air (dried by a drying tube) or argon. 

c) Extraction 

The extraction thimbles with the freeze-dried plant materia! were transferred immediately after 
the ventilation of the freeze dryer into the 5 ml ASE extraction cartridges. 

10 The 24 sample positions of an ASE apparatus (Accelerated Solvent Extractor ASE 200 with 
solvent controller and AutoASE software (from DIONEX)) were charged with plant samples. 

The polar substances were extracted with about 10 ml of methanol/water {80/20, v/v) at 
T = 70°C and p = 140 bar, 5 min heating period, 1 min static extraction. The more lipophilic 
substances were extracted with about 10 ml of methanol/dichloromethane (40/60, v/v) at 
15 T = 70°C and p = 140 bar, 5 min heating period, 1 min static extraction. Both solvent mixtures 
were extracted into the same sample tubes (50 ml centrifuge tubes with screw cap and piercable 
septum for the ASE (DIONEX)). 

The solution was mixed with internal standards: ribitol, L-glycine-2,2-d 2 , L-alanine-2,3,3,3-d 4 and 
a-methylgtucopyranoside and methyl nonadecanoate, methyl undeuanoate, methyl 
20 tridecanoate, methyl nonacosanoate. 

The complete extract was mixed with 8 ml of water. The solid residue of the plant sample and 
the extraction thimble were discarded. 

The extract was shaken and then centrifuged at a minimum of 1400 g for 5 to 10 min in order to 
speed up phase separation. 1 ml of the supernatant methanol/water phase ("polar phase", 

25 colorless) was removed for further GC analysis, and 1 ml was taken for LC analysis. The 
remainder of the methanol/water phase was discarded. 0.5 ml of the organic phase ("lipid 
phase", dark green) was taken for further GC analysis, and 0.5 ml was takfin fnr LC analysis. All 
the removed aliquots were evaporated to dryness using an IR Dancer infrared vacuum 
evaporator (Hettich). The maximum temperature during the evaporation process did not exceed 

30 40°C. The pressure in the apparatus was not less than 10 mbar. 
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d) Further processing of the lipid phase for LC/MS or LC/MS/MS analysis 

The lipid extract which had been evaporated to dryness was taken up in mobile phase. The 
HPLC run was carried out with gradient elution. 

The polar extract which had been evaporated to dryness was taken up in mobile phase. The 
5 HPLC run was carried out with gradient elution. 

e) Derivatization of the lipid phase for GC/MS analysis 

For the transmethanolysis, a mixture of 140 pi of chloroform, 37 pi of hydrochloric acid (37% by 
weight HCI in water), 320 pi of methanol and 20 pi of toluene was added to the evaporated 
extract. The vessel was tightly closed and heated at 100°C with shaking for 2 h. The solution 
1 0 was then evaporated to dryness. The residue was completely dried. 

The methoximation of the carbonyl groups took place by reaction with methoxyamine 
hydrochloride (5 mg/ml in pyridine, 100 pi in a tightly closed vessel at 60°C for 1 .5 h. 20 pi of a 
solution of odd-numbered, straight-chain fatty acids were added as time standards. Finally, 
100 pi of N-methyl-N-(trimethyl3ilyl)-2,2,2 trifluoroacetamide (MSTFA) were used for 
15 derivatization in the vessel, which was again tightly closed, at 60°C for 30 min. The final volume 
before GC injection was 220 pi. 

f) Derivatization of the polar phase for GC/MS analysis 

The methoximation of the carbonyl groups took place by reaction with methoxyamine 
hydrochloride (5 mg/ml in pyridine, 50 pi in a tightly closed vessel at 60°C for 1.5 h. 10 pi of a 
20 solution of odd-numbered, straight-chain fatty acids were added as time standards. Finally, 50 pi 
of N-methyl-N-(trimethylsilyl)-2,2,2-trifluoroacetamide (MSTFA) were used for derivatization in 
the vessel, which was again tightly closed, at 60°C for 30 min. The final volume before GC 
injection was 110 pi. 

g) Analysis of the various plant samples 

25 The plant samples were measured in single series each of 20 plant samples (so-called 

sequences), each sequence comprising at least 5 wild-type plants as control. The peak aiea or 
the peak height for each analyte was divided by the peak area for the respective internal 
standard. The data was standardized to the initial fresh weight of plant. The values calculated in 
this way were related to the wild-type control group by dividing them by the average of the 

30 corresponding data for the wild-type control group of the same sequence. The resulting values 
were referred to as x-fold, are comparable over all sequences and indicate by how much the 
analyte concentration differs in the mutant relative to the wild-type control. 
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Alternatively, the amino acids can advantageously be detected by HPLC fractionation in ethanol 
extracts by the method of Geigenberger et a!. (Plant Celi & Environ, 19, 1996: 43 - 55). 

The results of the various analyses of the plants are to be found in the following table: 



Analyte No 


Analyte 


Ratio_by_WT 


Ratio_by_median 


GC/LC 


10000032 


Methionine 


3.46-3.58 


3.31-3.4 


LC 


10000034 


Threonine 


0.45-0.15 


0.61-0.15 


LC 


10000006 


Threonine 


0.17-0.16 


0.18-0.16 


GC 


10000008 


Methionine 


3.31-3.67 


3.5-3.53 


GC 



1 0 Column 1 in the table shows the sample number. The analyzed amino acid is to be found in 
column 2. Column 3 shows the ratio for the analyzed amino acid between the transgenic plant 
and the wild type. Column 4 shows the ratio for the transgenic plant compared with the median 
for other transgenic plants not transformed with the threonine aldolase gene. Column 5 shows 
the analytical method. 

15 All the results were revealed to be significant on independent repetition of the analyses. 
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The process for propping amino acids in transgenic organisms, wherein the process 
comprises the following steps: 

a) introduction of a nucleic acid sequence which codes for a threonine- 
degrading protein, or 

b) introduction of a nucleic acid sequence which increases threonine 
degradation in the transgenic organisms, and 

e) expression of a nucleic acid sequence mentioned under (a) or (b) in the 

transgenic organism. 

The process for preparing amino acids in transgenic organisms according to claim 1 , 
wherein there is introduction in process step (a) as set forth in claim 1 of a nucleic acid 
sequence which is selected from the group of nucleic acid sequences: 

i) of a nucleic acid sequence having the sequence depicted in SEQ ID NO: 1 ; 

ii) of a nucleic acid sequence obtained owing to the degeneracy of the genetic 
code through back-translation of the amino acid sequence depicted in SEQ ID 
NO: 2, and 

iii) of a derivative of the nucleic acid sequence depicted in SEQ ID NO: 1 which 
codes for polypeptides having the amino acid sequence depicted in SEQ ID NO: 
2 and have at least 50% homology at the amino acid level, with a negligible 
reduction in llie biological activity of the polypeptides. 

The process for preparing amino acids in transgenic organisms according to claim 1, 
wherein there is introduction in process step (a) as set forth in claim 1 of a nucleic acid 
sequence which is selected from the group of nucleic acid sequences: 

i) of a nucleic acid sequence obtained owing to the degeneracy of the genetic 
code through back-transiation of the amino acid sequence depicted in SEQ ID 
NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID 
NO: 8, SEQ ID NO: 9 or SEQ ID NO: 10; 

ii) of a derivative of the nucleic acid sequence which is obtained by back-translation 
of the amino acid sequence depicted in SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID 
NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or SEQ ID 
NO: 1 0 and which has at least 70% homology at the amino acid level with the 
aforementioned amino acid sequences, with a negligible reduction in the 
biological activity of the polypeptides. 
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4. The process for preparing amino acids in transgenic organisms according to claims 1 to 

3, wherein the transgenic organism is cultivated and harvested after introduction and 
expression of the nucleic acid. 

5. The process for preparing amino acids in transgenic organisms according to claims 1 to 

4, wherein the amino acid is isolated from the organism or the culture medium or the 
organism and the culture medium. 

6. The process for preparing amino acids in transgenic organisms according to claims 1 to 
3, wherein the amino acid is selected from the group of methionine, homoserine and 
lysine. 

7. The process for preparing amino acids in transgenic organisms according to claims 1 to 

6. wherein the essential amino acid methionine is involved. 

8. The process for preparing amino acids in transgenic organisms according to claims 1 to 

7. wherein the transgenic organism is a microorganism or a plant. 

9. The process for preparing amino acids in transgenic organisms according to claims 1 to 

8. wherein the transgenic organism is a microorganism selected from the group of 
genera Corynebacterium, Brevibacterium, Escherichia, Bacillus, Rhodotorula, 
Hansenula, Schizosaccharomyces, Saccharomyces, Candida, Claviceps or 
Flavobacterium. 

10. The process for preparing amino acids in transgenic organisms according to claims 1 to 

8, wherein the transgenic organism is a plant selected from the qroup of crop plants. 

11. The process for preparing amino acids in transgenic organisms according to claim 1 0, 
wherein the transgenic organism is a plant selected from the group of peanut, oilseed 
rape, canola, sunflower, safflower, olive, sesame, hazelnut, almond, avocado, bay, 
pumpkin, lettuce, flax, soybean, pistachio, borage, corn, wheat, rye, oats, millet, triticaie, 
r j Ctt: bai ley, cassava, potato, sugar beet, feed beet, aubergine, tomato, pea, alfaalfa and 
perennial grasses and feed crops. 

1 2. The process for preparing amino acids in transgenic organisms according to claims 1 to 

1 1 , wherein the nucleic acid sequence is derived from a eukaryote. 

13. The process for prepanng amino acids in transgenic organiamo according to claims 1 to 

12, wherein the nucleic acid sequence is derived from the genus Saccharomyces. 

14. The process for preparing amino acids in transgenic organisms according to claims 1 to 

13, wherein the nucleic acid sequence is for introduction and for expression 
incorporated into a nucleic acid construct or a vector. 
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15. The process for preparing amino acids in transgenic organisms according to claims 1 to 
14, wherein additionally biosynthesis genes of the amino acid prepared in the process 
are introduced into the organism. 

16. A nucleic acid construct comprising a nucleic acid sequence as set forth in claim 2 or 3, 
which is functionally linked to one or more regulatory signals. 

17. A vector comprising a nucleic acid sequence as set forth in claim 2 or 3 or a nucleic acid 
construct as set forth in claim 16. 

18. A transgenic prokaryotic or eukaryotic organism comprising at least one nucleic acid 
sequence as set forth in claim 2 or 3 or at least one nucleic acid construct as set forth in 
claim 1 6 or at least one vector as set forth in claim 1 7. 

19. The transgenic prokaryotic or eukaryotic organism according to claim 18, which is a 
microorganism or a plant. 

20. The transgenic prokaryotic or eukaryotic organism according to claim 19, which is a 
microorganism of the genus Corynebacterium or Brevibacterium. 

21. The transgenic prokaryotic or eukaryotic uiyanism according to claim 19, which i3 a 
plant selected from the group of genus of peanut, oilseed rape, canola, sunflower, 
safflower, olive, sesame, hazelnut, almond, avocado, bay, pumpkin, lettuce, flax, 
soybean, pistachio, borage, corn, wheat, rye, oats, millet, triticale, rice, barley, cassava, 
potato, sugar beet, feed beet, aubergine, tomato, pea, alfaalfa and perennial grasses 
and feed crops. 

22. The use of the transgenic organisms as set forth in claims 1 8 to 21 or of an amino acid 
prepared by a process as set forth in claims 1 to 15 for producing an animal or human 
food, for producing cosmetics or pharmaceuticals. 

23. An amino acid sequence selected from the group of sequences SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9 or 
SEQ ID NO: 10 
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Process for preparing amino acids 
Abstract 

A process for preparing amino acids in transgenic organisms, wherein the process comprises 
the following steps: 

5 a) introduction of a nucleic acid sequence which codes for a threonine-degrading 

protein, or 

b) introduction of a nucleic acid sequence which increases threonine degradation in the 
transgenic organisms, and 

c) expression of a nucleic acid sequence mentioned under (a) or (b) in the transgenic 
10 organism. 

Advantageously, there is introduction in process step (a) of a nucleic acid sequence selected 
from the group 

i) of a nucleic acid sequence having the sequence depicted in SEQ ID NO: 1 ; 

ii) of a nucleic acid sequence obtained owing to the degeneracy of the genetic 
code through back-translation of the amino acid sequence depicted in SEQ ID 
NO: 2, and 

iii) of a derivative of the nucleic acid sequence depicted in SEQ ID NO: 1 which 
codes for polypeptides having the amino acid sequence depicted in SEQ ID NO: 
2 and have at least 50% homology at the amino acid level, with a negligible 
reduction in the biological activity of the polypeptides. 
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SEQUENZ PROTOKOLL 

<110> Metanomics GmbH & Co. KGaA 

<120> Verfahren zur Hers tel lung von Aminosauren 

<130> 2002_960 

<140> PF54195 
<141> 2002-12-20 

<160> 10 

<170> Patentln Vers, 2.0 

<210> 1 
<211> 1164 
<212> DNA 

<213> Saccharomyces cerevisiae 

<220> 

<221> CDS 

<222> (1) . . (1164) 

<223> Thjreonin-AldolaEe 

<400> 1 

atg act gaa ttc gaa ttg cct cca aaa tat ate acc- get get aac gac 48 

Met Thr Glu Fhe Glu Leu Fro pro bys Tyr He Thr Ala Ala Asn Asp 

15 10 15 

ttg egg tea gac aca ttc acc act cca act gca gag atg atg gag gec 96 
Leu Arg Ser Asp Thr Phe Thr Thr Pro Thr Ala Glu Met Met Glu Ala 

20 25 30 

get tta gag gec tct ate ggt gac get gtc tac ggt gaa gat gtt gac ldd 
Ala Leu Glu Ala Ser lie Gly Asp Ala Val Tyr Gly Glu Asp Val Asp 
35 40 45 

acc gtt agg etc gaa cag acc gtt gec cgc atg get ggc aaa gaa gca 192 
Thr Val Arg Leu Glu Gin Thr Val Ala Arg Met Ala Gly Lys Glu Ala 
50 55 60 

ggt ttg ttc tgt gtc tct ggg act ttg tec aac cag att gec ate aga 240 
Gly Leu Phe Cys Val Ser Gly Thr Leu Ser Asn Gin lie Ala lie Arg 
65 70 75 80 

act cac ttg atg caa cct cca tac tct att eta tgt gat tac agg get 288 
Thr His Leu Met Gin Pro Pro Tyr Ser lie Leu Cys Asp Tyr Arg Ala 
85 90 95 

cac gtt tac act cac gaa gec get gga ctg gcg ate ttg tct caa gcg 336 
His Val Tyr Thr His Glu Ala Ala Gly Leu Ala lie Leu Ser Gin Ala 
100 105 110 

atg gtg gtt cct gtg gtt cct tec aac ggt gac tac ttg acc ttg gaa 384 
Met Val Val Pro Val Val Pro Ser Asn Gly Asp Tyr Leu Thr Leu Glu 
115 120 125 

gac ate aag tea cac tac gtc cca gac gac ggt gat att cac ggt gec 432 
Asp lie Lys Ser His Tyr Val Pro Asp Asp Gly Asp He His Gly Ala 

130 135 140 

ccc acc aga ttg att tct ctg gaa aac act tta cac ggt att gtt tat 480 
1 
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Pro Thr Arg Leu lie Ser Leu Glu Asn Thr Leu His Gly lie Val Tyr 
145 150 155 1G0 

cca ttg gaa gaa ctg gtc cgc ate aaa get tgg tgt atg gaa aat ggt 528 
Pro Leu Glu Glu Leu Val Arg lie Lys Ala Trp Cys Met Glu Asn Gly 
165 170 175 

etc aaa eta cat tgt gac ggt gec aga ate tgg aat gec get gca caa 576 
Leu Lys Leu His Cys Asp Gly Ala Arg lie Trp Asn Ala Ala Ala Gin 
180 185 190 

tct ggc gtg cca tta aag caa tat ggg gaa ate ttc gac tec ate tec 624 
Ser Gly Val Pro Leu Lys Gin Tyr Gly Glu lie Phe Asp Ser lie Ser 
195 200 " 205 

ate tgt eta tec aag tct atg ggt get cct att ggg tec gtc ttg gtt 672 
lie Cys Leu Ser Lys Ser Met Gly Ala Pro lie Gly Ser Val Leu Val 
210 215 220 

ggg aac ctt aag ttt gtc aag aag gec acc cat ttc aga aaa caa caa 720 
Gly Asn Leu Lys Phe Val Lys Lys Ala Thr His Phe Arg Lys Gin Gin 
225 230 235 240 

ggt ggt ggt att aga caa tct ggt atg atg get aga atg get ctt gta 7 58 
Gly Gly Gly He Arg Gin Ser Gly Met Met Ala Arg Met Ala Leu Val 
245 250 255 

aac ate aac aac gat tgg aag tec caa ttg ctg tac teg cac tct ttg a 16 
Asn He Asn Asn Asp Trp Lys Ser Gin Leu Leu Tyr Ser His Ser Leu 
260 265 270 

get cat gaa tta gec gaa tat tgt gag gca aag ggc ate ccg eta gag 864 
Ala His Glu Leu Ala Glu Tyr Cys Glu Ala Lys Gly He Pro Leu Glu 
275 280 285 

tct cca gca gac acc aac ttt gtc ttt att aac ctg aag gec get aga 912 
Ser Pro Ala Asp Thr Asn Phe Val Phe He Asn Leu Lys Ala Ala Arg 
290 295 300 

atg gac cca gat gtc ctt gtt aag aag ggt ttg aag tac aac gtt aag 960 
Met Asp Pro Asp Val Leu Val Lys Lys Gly Leu Lys Tyr Asn Val Lys 
305 310 315 320 

eta atg ggt ggt aga gtc teg ttc cac tat caa gtc acc aga gat act 1008 
Leu Met Gly Gly Arg Val Ser Phe His Tyr Gin Val Thr Arg Asp Thr 
325 330 335 

ttg gaa aaa gtc aaa ttg gee ate tec gag gec ttc gac tat get aaa 1056 
Leu Glu Lys Val Lys Leu Ala He Ser Glu Ala Phe Asp Tyr Ala Lys 

340 345 350 

gaa cat cct ttc gac tgt aac gga cct acc cag att tac cgt agt gaa 1104 
Glu His Pro Phe Asp Cys Asn Gly Pro Thr Gin He Tyr Arg Ser Glu 
355 360 365 

tec acc gag gtc gac gtt gat ggc aac get ate cgc gaa ata aaa acc 1152 
Ser Thr Glu Val Asp Val Asp Gly Asn Ala He Arg Glu He Lys Thr 
370 375 380 

tac aaa tac tga 1164 

Tyr Lys Tyr 
385 



2 
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<210> 2 
<211> 387 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Thr Glu Phe Glu Leu Pro Pro Lys Tyr lie Thr Ala Ala Asn Asp 
1 5 10 15 

Leu Arg Ser Asp Thr Phe Thr Thr Pro Thr Ala Glu Met Met Glu Ala 
20 25 30 

Ala Leu Glu Ala Ser He Gly Asp Ala Val Tyr Gly Glu Asp Val Asp 
35 40 45 

Thr Val Arg Leu Glu Gin Thr Val Ala Arg Met Ala Gly Lys Glu Ala 
50 55 60 

Gly Leu Phe Cys Val Ser Gly Thr Leu Ser Asn Gin He Ala He Arg 



Thr His Leu Met Gin Pro Pro Tyr Ser He Leu Cys Asp Tyr Arg Ala 
85 90 95 

His Val Tyr Thr His Glu Ala Ala Gly Leu Ala He Leu Ser Gin Ala 
100 105 110 

Met Val Val Pro Val Val Pro Ser Asn Gly Asp Tyr Leu Thr Leu Glu 
115 120 125 

Asp He Lys Ser His Tyr Val Pro Asp Asp Gly Asp He His Gly Ala 
130 135 140 



Pro Leu Glu Glu Leu Val Arg He Lys Ala Trp Cys Met Glu Asn Gly 
165 170 175 

Leu Lys Leu His Cys Asp Gly Ala Arg He Trp Ash Ala Ala Ala Gin 
180 185 190 

Ser Gly Val Pro Leu Lys Gin Tyr Gly Glu He Phe Asp Ser He Ser 
195 200 205 

He Cys Leu Ser Lys Ser Met Gly Ala Pro He Gly Ser Val Leu Val 
210 215 220 



Ser Pro Ala Asp Thr Asn Phe Val Phe He Asn Leu Lys Ala Ala Arg 
290 295 300 
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Met Asp Piu Asp Veil Leu Veil Lys Lys Gly L«u Lys Tyi Asn Val Lys 
305 310 315 320 

Leu Met Gly Gly Arg Val Ser Phe His Tyr Gin Val Thr Arg Asp Thr 
325 330 335 



Glu His Pro Phe Asp Cys Asn Gly Pro Thr Gin lie Tyr Arg Ser Glu 
355 360 365 

Ser Thr Glu Val Asp Val Asp Gly Asn Ala He Arg Glu He Lys Thr 
370 375 380 



<210> 3 
<211> 376 
<212> PRT 
■^213^ Soya 

<400> 3 

Gly Cys Phe Ala Cys Tyr Leu Val Gly Gly Phe Ser Val Gin Glu Lys 
15 10 15 

Met Val Thr Arg He Val Asp Leu Arg Ser Asp Thr Val Thr Lys Pro 
20 25 30 

Thr Glu Ala Met Arg Ala Ala Met Ala Ser Ala Glu Val Asp Asp Asp 
35 40 45 

Val Leu Gly Tyr Asp Pro Thr Ala Phe Arg Leu Glu Thr Glu Met Ala 
50 55 60 

Lys Thr Met Gly Lys Glu Ala Ala Leu Phe Val Pro Ser Gly Thr Met 



Gly Asn Leu Val Ser Val Leu Val His Cys Asp Val Arc; Gly Ser Glu 
85 90 95 



He Ala Thr He Gly Gly Val His Pro Arg Gin Val Lys Asn Asn Asp 
115 120 125 

Asp Gly Thr Met Asp He Asp Leu He Glu Ala Ala He Arg Asp Pro 
130 135 140 

Met Gly Glu Leu Phe Tyr Pro Thr Thr Lys Leu He Cys Leu Glu Asn 
145 150 155 160 



Gly Ala Arg He Phe Asn Ala Ser Val Ala Leu Gly Val Pro Val Asp 
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195 



200 



205 



Arg Leu Val Gin Ala Ala Asp Ser Val Ser Val Cys Leu Ser Lys Gly 
210 215 220 

lie Gly Ala Pro Val Gly Ser Val He Val Gly Ser Lys Asn Phe He 
225 230 235 240 

Ala Lys Ala Arg Arg Leu Arg Lys Thr Leu Gly Gly Gly Met Arg Gin 
245 250 255 

He Gly Leu Leu Cys Ala Ala Ala Leu Val Ala Leu Gin Glu Asn Val 
260 265 270 

Gly Lys Leu Glu Ser Asp His Lys Lys Ala Arg Leu Leu Ala Asp Gly 
275 280 285 

Leu Asn Glu Val Lys Gly Leu Arg Val Asp Ala Cys Ser Val Glu Thr 
290 295 300 

Asn Met Val Phe He Asp He Glu Glu Gly Thr Lys Thr Arg Ala Glu 
305 310 315 320 

Lys He Cys Lys Tyr Met Glu Glu Arg Gly He Leu Val Met Gin Glu 
325 330 335 

Ser Ser Ser Arg Met Arg Val Val Leu His His Gin He Ser Ala Ser 
340 345 350 

Asp Val Gin Tyr Ala Leu Ser Cys Phe Gin Gin Ala Leu Ala Val Lys 
355 360 365 

Gly Val Gin Lys Glu Met Gly Asn 
370 375 



<210> 4 
<211> 115 
<212> PRT 
<213> Soya 

<400> 4 

Leu Phe Gly Leu Leu Ala He Leu Leu Glu Tyr Leu Glu Lys Met Val 
15 10 15 

Pro Arg He Val Asp Leu Arg Ser Asp Thr Val Thr Lys Pro Ser Glu 
20 25 30 

Ala Met Arg Ala Ala Met Ala Ser Ala Glu Val Asp Asp Asp Val Leu 
35 40 45 

Gly Arg Asp Pro Ser Cys Phe Arg Leu Glu Thr Glu Met Ala Lys He 
50 55 60 

Leu Gly Lys Glu Gly Ala Leu Phe Val Pro Ser Gly Thr Met Ala Asn 
65 70 75 80 

Leu He Ser Val Leu Val His Cys Asp He Arg Gly Ser Glu Val He 
85 90 95 

Leu Gly Asp asn ser His He His He Tyr Glu Asn Gly Gly He Ala 



100 



105 



110 
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Thr Leu Gly 
115 



<210> 5 
<211> 127 
<212> PRT 
<213> Soya 



<400> 5 

Lys Thr Leu Xaa Gly Gly Met Arg Gin Val Gly He Leu Cys Ala Ala 
15 10 15 

Ala Leu Val Ala Leu Gin Glu Asn Val Gly Lys Leu Gin Ser Asp His 
20 25 30 

Asn Lys Ala Lys Leu Leu Ala Asp Gly Leu Asn Glu lie Lys Gly Leu 
35 40 45 

Arg Val Asp He Ser Ser Val Glu Thr Asn He He Tyr Val Glu Val 
50 55 60 

Glu Glu Gly Ser Arg Ala Thr Ala Ala Lys Leu Cys Lys Asp Leu Glu 
65 70 75 80 

Asp Tyr Gly He Leu Leu Met Pro Met Gly Ser Ser Arg Leu Arg He 
85 90 95 

Val Phe His His Gin He Ser Ala Ser Asp Val Gin Tyr Ala Leu Ser 
100 105 110 

Cys Phe Gin Gin Ala Val Asn Gly Val Arg Asn Glu Asn Gly Asn 
115 120 125 



<210> 6 
<211> 147 
<212> PRT 
<213> Rice 

<400> 6 

Gly Arg Arg Phe Arg Ala He Arg Asp Pro Met Gly Glu Leu Phe Tyr 
15 10 15 

Pro Thr Thr Lys Leu He Cys Leu Glu Asn Thr His Ala Asn Ser Gly 
20 25 30 

Gly Arg Cys Leu Ser Val Glu Tyr Thr Asp Arg Val Gly Glu Leu Ala 
35 40 45 

Lys Lys His Gly Leu Lys Leu His He Asp Gly Ala Arg He Phe Asn 



Ala Ser Val Ala Leu Gly Val Pro Val Asp Arg Leu Val Gin Ala Ala 

65 70 75 80 

Asp Ser Val Ser Val Cys Leu Ser Lys Gly lie Gly Ala Pro Val Gly 

85 90 95 



Arg Lys Thr Leu Gly Gly Gly Met Arg Gin He Gly Leu Leu Cys Ala 
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115 120 125 

Ala Ala Leu Val Ala Leu Gin Glu Asn Val Gly Lys Leu Glu Ser Asp 
130 135 140 

His Lys Lys 
145 



<210> 7 
<211> 169 
<212> PRT 
•=213=- Rice 



<400> 7 

Gly lie Pro Gly Xaa Thr Phe Arg Gly Asp Val Ala Lys Ser His Gly 
15 10 15 

Leu Lys Leu His He Asp Gly Ala Arg He Phe Asn Ala Ser Val Ala 
20 25 30 

Leu Gly Val Pro Val His Arg Leu Val Lys Ala Ala Asp Ser Val Ser 
35 40 45 

Val Cys He Ser Lys Gly Leu Gly Ala Pro Val Gly Ser Val He Val 
50 55 60 

Gly Ser Thr Ala Phe He Glu Lys Ala Lys He Leu Thr Lys Thr Leu 



Gly Gly Gly Met Arcr Gin Val Gly He Leu Cys Ala Ala Ala Tyr Val 
85 90 95 

Ala Val Arg Asp Thr Val Gly Lys Leu Ala Asp Asp His Arg Arg Ala 
100 105 110 

Lys Val Leu Ala Asp Gly Leu Lys Lys He Lys His Phe Arg Val Asp 
115 120 125 

Thr Thr Ser Val Glu Thr Asn Met Val Phe Phe Asp He Val Asp Ser 
130 135 140 

Arg He Ser Pro Asp Lys Leu Cys Gin Val Leu Glu Gin Arg Asn Val 
145 150 155 160 



<210> 8 
<211> 362 
<212> PRT 
<213> Canola 

<400> 8 

He Glu He Lys Met Val Met Arg Thr Val Asp Leu Arg Ser Asp Thr 

15 10 15 

Val Thr Arg Pro Thr Asp Ala Met Arg Glu Ala Met Gly Ser Ala Glu 
20 25 30 

Val Asp Asp Asp Val Leu Gly Tyr Asp Pro Thr Ala Arg Arg Leu Glu 
35 40 45 



7 
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Glu Glu lie Ala Lys Met Met Gly Lys Glu Ala Ala Leu Phe Val Pro 

50 55 60 

Ser Gly Thr Met Gly Asn Leu lie Cys Val Met Val His Cys Asp Val 



Arg Gly Ser Glu Val lie Leu Gly Asp Asn Cys His lie His Val Tyr 
85 90 95 

Glu Asn Gly Gly He Ser Thr He Gly Gly Val His Pro Lys Thr He 
100 105 110 

Lys Asn Glu Glu Asp Gly Thr Met Asp Leu Gly Ala He Glu Ala Ala 
115 120 125 

He Arg Asp Pro Lys Gly Ser Thr Phe Tyr Pro Ser Thr Arg Leu lie 
130 135 140 



Leu His He Asp Gly Ala Arg Leu Phe Asn Ala Ser He Ala Leu Gly 
180 185 190 

Val Pro Val His Arg Leu Val Gin Ala Ala Asp Ser Val Ser Val Cys 
195 200 205 



Gly Met Arg Gin He Gly Val Leu Cys Ala Ala Ala Leu Val Ala Leu 
245 250 255 

Gin Glu Asn Leu Pro Lys Leu Gin Phe Asp His Lys Lys Thr Lys Leu 
260 265 270 

Leu Ala Glu Gly Leu Asn Gin Met Lys Gly He Arg Val Asn Val Ala 
275 280 285 

Ala Met Glu Thr Asn Met He Phe Met Asp Met Glu Asp Gly Ser Lys 
290 295 300 



<210> 9 
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<211> 196 
<212> FRT 

<213> Canola 
<400> 9 

Gly Phe Leu Leu Lys His Lys Tyr lie Tyr Tyr Cys Cys Tyr Leu Phe 
15 10 15 

Glu Ser Lys S(?r- Asri Asn Phf=> T.p-n Phe> Ser Val Tip T.ys Met: Va1 Thr 
20 25 30 

Pro Val lie Arg Thr Val Asp Leu Arg Ser Asp Thr Val Thr Lys Pro 
35 40 45 

Thr Glu Ser Met Arg Ser Ala Met Ala Asn Ala Glu Val Asp Asp Asp 
50 55 60 

Val Leu Gly Asn Asp Pro Thr Ala Val Leu Leu Glu Arg Glu Val Ala 
65 70 75 80 

Glu lie Ala Gly Lys Glu Ala Ala Met Phe Val Pro Ser Gly Thr Met 
85 90 95 

Gly Asn Leu lie Ser Vol Leu Val His Cys Asp Glu Arg Gly Ser Glu 
100 105 110 

Val He Leu Gly Asp Asp Ser His He His He Tyr Glu Asn Gly Gly 
115 120 125 

Val Ser Ser Leu Gly Gly Val His Pro Arg Thr Val Lys Asn Glu Glu 
130 135 140 

Asp Gly Thr Met Glu He Ser Ser He Glu Ala Ala Val Arg Ser Pro 
145 150 155 160 

Thr Gly Asp Leu His Tyr Pro Val Thr Lys Leu He Cys Leu Glu Asn 
165 170 175 

Thr Gin Ala Asn Cys Gly Gly Arg Cys Leu Pro He Glu Tyr lie Asp 
180 185 190 

Lys Val Gly Glu 
195 

<210> 10 
<211> 104 
<212> PUT 
<213> Canola 

<400> 10 

He Gly He Lys Met Val Met Arg He Val Asp Leu Arg Ser Asp Thr 
15 10 15 

Val Thr Arg Pro Thr Asp Ala Met Arg Glu Ala Met Ala Ser Ala Glu 
20 25 30 

Val Acp Acp Acp Val Leu Gly Tyr Acp Pro Thr Ala Arg Gly Leu Glu 
35 40 45 

Glu Glu Met Ala Lys Met Met Gly Lys Glu Ala Ala Leu Phe Val Pro 
50 55 60 



Ser Gly Thr Met Gly Asn Leu He Cys Val Met Val His Cys Asp Val 
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Arg Gly Ser Glu Val He Leu Gly Asp Thr Cys His He His Val Tyr 
85 90 95 

Glu Asn Gly Gly He Ser Thr He 
100 



10 



